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Abstract 



In this work, we show how to obtain for non-compact manifolds the results that 
have already been done for Monge Transport Problem for costs coming from Tonelli 
Lagrangians on compact manifolds. In particular, the already known results for 
a cost of the type d'^,r > 1, where d is the Riemannian distance of a complete 
Riemannian manifold, hold without any curvature restriction. 

1 Introduction 

Monge transportation problem is more than 200 hundred years old, see [3l]. It has 
generated a huge amount of work. It is impossible for us to give all the relevant references 
to previous works. There are now several books and surveys that can help the reader 
through the literature, see for example among others the books [121 IH EH SD 112], and 
the surveys [21 UHl ESj. Both authors benefited a lot from all these sources. 

Originally Monge wanted to move, in 3-space, rubble (deblais) to build up a mound 
or fortification (remblais) minimizing the cost. Now if the rubble consist of masses 
mi,...,m„ at locations {xi,...x„}, one should move them into another set of posi- 
tions {yi, . . . ,yn} by minimizing the traveled distance taking into accounts the weights. 
Therefore one should try to minimize 



over all bijections T : {xi, . . . Xn} {yi, ■ ■ ■ , yn}, where d is the usual Euclidean distance 
on 3-space. 
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Nowadays, one would be more interested in minimizing the energy cost rather than 
the traveled distance. Therefore one would try rather to minimize 



J2mi(fixi,Tixi)). (2) 

i=l 

Of course, one would like to generalize to continuous rather than just discrete distribu- 
tions of matter. Therefore Monge transportation problem is now stated in the following 
general form: given two probability measures fi and u, defined on the measurable spaces 
X and Y, find a measurable map T : X ^ Y with 

Tj/i = I/, (3) 

i.e. 

u{A) = fi (T-^iA)) \/AcY measurable, 
and in such a way that T minimize the transportation cost. This last condition means 



c{x,T{x)) dfi{x) = min < / c{x, S{x)) dfi{x) 

where c : X x F ^ M is some given cost function, and the minimum is taken over all 
measurable maps S : X ^ Y with S'jj/i = z/. When condition is satisfied, we say that 
T is a transport map, and if T minimize also the cost we call it an optimal transport map. 

It is easy to build examples where the Monge problem is ill-posed simply because there 
is no transport map: this happens for instance when /i is a Dirac mass while u is not. 
This means that one needs some restrictions on the measures /i and u. 

Even in Euclidean spaces, and the cost c equal to the Euclidean distance or its square, 
the problem of the existence of an optimal transport map is far from being trivial. Due 
to the strict convexity of the square of the Euclidean distance, case ([2]) above is simpler 
to deal with than case ([T]). The reader should consult the books and surveys given above 
to have a better view of the history of the subject, in particular Villani's second book 
on the subject [12]. However for the case where the cost is a distance, like in ([T]), one 
should cite at least the work of Sudakov [32], Evans-Gangbo [IH], Feldman-McCann [23], 
Caffarelli-Feldman-McCann [11], Ambrosio-Pratelli [5], and Bernard-Buffoni [8]. For the 
case where the cost is the square of the Euclidean or of a Riemannian distance, like in ([2]), 
one should cite at least the work of Knott-Smith [31J, Brenier [9], Rachev-Riischendorf 
[3H], Gangbo-McCann [27], McCann [33], and Bernard-Buffoni [7]. 

Our work is related to the case where the cost behaves like a square of a Riemannian 
distance. It is strongly inspired by the work of Bernard-Buffoni [7]. In fact, we prove the 
non-compact version of this last work adapting some techniques that were first used in the 
euclidean case in [1] by Ambrosio,Gigli, and Savare. We show that the Monge transport 
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problem can be solved for the square distance on any complete Riemannian manifold 
without any assumption on the compactness or curvature, with the usual restriction on 
the measures. Most of the arguments in this work are well known to specialists, at least 
in the compact case, but they have not been put together before and adapted to the case 
we treat. Of course, there is a strong intersection with some of the results that appear 
in [12]. For the case where the cost behaves like the distance of a complete non-compact 
Riemannian manifold see the work [21] of the second author. 

We will prove a generalization of the following theorem (see Theorems 14.21 and 14.31) : 

Theorem 1.1. Suppose that M is a connected complete Riemannian manifold, whose 
Riemannian distance is denoted by d. Suppose that r > 1. If fi and v are probability 
(Borel) measures on M, with fi absolutely continuous with respect to Lebesgue measure, 
and 



for some given xq G M, then we can find a transport map T : M ^ M, with T^jj, = u, 
which is optimal for the cost on M x M. Moreover, the map T is uniquely determined 
fi-almost everywhere. 

We recall that a measure on a smooth manifold is absolutely continuous with respect 
to the Lebesgue measure if, when one looks at it in charts, it is absolutely continuous with 
respect to Lebesgue measure. Again we note that there is no restriction on the curvature 
of M in the theorem above. 

The paper is structured as follows: in section [2] we recall some known results on the 
general theory of the optimal transport problem, and we introduce some useful definitions. 
Then, in section |3] we will give very general results for the existence and the uniqueness 
of optimal transport maps (Theorems 13.11 and 13.21 and Complement 13.41 ). In section H] 
the above results are applied in the case of costs functions coming from (weak) Tonelli 
Lagrangians (Theorems 14.21 and 14.31) . In sectional we study the so called "dispacement 
interpolation", showing a countably Lipschitz regularity for the transport map starting 
from an intermidiate time (Theorem 15.11) . Finally, in the appendix, we collect all the 
tecnical results about semi-concave functions and Tonelli Lagrangians used in our proofs. 

Acknowledgements: The authors would like to thank particularly Cedric Villani for 
communicating to them his Saint-Flour Lecture Notes [42j. They also warmly thank the 
anonymous referee for his comments and for pointing out to them Remark 13. 5[ The first 
author learned most of the subject from invaluable informal almost daily conversations 
with Cedric Villani. 
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2 Background and some definitions 



A major advance to solve the Monge transport problem is due to Kantorovich. He pro- 
posed in [29], [30] a notion of weak solution of the transport problem. He suggested to 
look for plans instead of transport maps, that is probability measures j in X x Y whose 
marginals are /i and z/, i.e. 

(7rx)t)7 = and (7ry)jj7 = u, 

where tcx '■ X xY ^ X and iry '■ X x Y —>■ Y are the canonical projections. Denoting by 
H(/i, u) the set of plans, the new minimization problem becomes then the following: 



If 7 is a minimizer for the Kantorovich formulation, we say that it is an optimal plan. 
Due to the linearity of the constraint 7 G H(yU, u), it turns out that weak topologies can 
be used to provide existence of solutions to (HI): this happens for instance whenever X 
and Y are Polish spaces and c is lower semicontinuous (see [37], [HI Proposition 2.1] or 
[42] ) . The connection between the formulation of Kantorovich and that of Monge can be 
seen by noticing that any transport map T induces the plan defined by (Idx x^)tt/^ which 
is concentrated on the graph of T, where the map Idx xT : X ^ X x Y is defined by 



It is well-known that a linear minimization problem with convex constraints, like (jlj), 
admits a dual formulation. Before stating the duality formula, we make some definitions 
similar to that of the weak KAM theory (see [21]): 

Definition 2.1 (c-subsolution). We say that a pair of functions ip : X ^ WU {+00}, 
ijj : Y —>■ U {—00} is a c-subsolution if 



Observe that when c is measurable and bounded below, and {ip, ip) is a c-subsolution 
with ip G L^{fi),ip e L^{i^), then 




Idx ><T(x) = {x,T{x)). 



W{x,y)eXxY, ipiy) - ip{x) < c{x,y). 




If moreover J, 



XxY 



c(x, y) d'j < +00, and 
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then one would obtain the following equality: 



%l){y) - ip{x) = c{x, y) for 7-a.e. (x, y) 



(without any measurability or integrability assumptions on [ip^ip), this is just a formal 
computation). 

Definition 2.2 (Calibration). Given an optimal plan 7, we say that a c-subsolution 
(V9, ■?/') is {c,--^)- calibrated if ip and ip are Borel measurable, and 



Theorem 2.3 (Duality formula). Let X and Y he Polish spaces equipped with proba- 
bihty measures fi and v respectively, c : X xY —>■ M a lower semicontinuous cost function 
bounded from below such that the infimum in the Kantorovitch problem ^ is finite. 
Then a transport plan 7 G n(/i, u) is optimal if and only if there exists a (c, 'j) -calibrated 
subsolution {(p,ip). 

For a proof of this theorem see [38] and [111 Theorem 5.9 (ii)]. 

In this work we study Monge's problem on manifolds for a large class of cost func- 
tions induced by Lagrangians like in [7|, where the authors consider the case of compact 
manifolds. We generalize their result to arbitrary non-compact manifolds. 

Following the general scheme of proof, we will first prove a result on more general 
costs, see Theorem 13.21 In this general result, the fact that the target space for the 
Monge transport is a manifold is not necessary. So we will assume that only the source 
space (for the Monge transport map) is a manifold. 

Let M be a n-dimensional manifold (Hausdorff and with a countable basis), a Polish 
space, c : M X — >■ M a cost function, /x and u two probability measures on M and 
respectively. We want to prove existence and uniqueness of an optimal transport map 
T : M ^ N, under some reasonable hypotheses on c and fi. 

One of the conditions on the cost c is given in the following definition: 

Definition 2.4 (Twist Condition). For a given cost function c{x,y), we define the skew 
left Legendre transform as the partial map 



ip{y) - ^{x) = c{x,y) for 7-a.e. (x,?/). 



A[: M X N ^ T*M, 



Kix,y) = (x. 



dc 
dx 



{x,y)), 



whose domain of definition is 




dc 
dx 



(x, y) exists 
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Moreover, we say that c satisfies the left twist condition if is injective on D(A^). 

One can define similarly the skew right Legendre transform : M x N T*N by 
A^(x,?/) = {y, The domain of definition of is V{Al) = {{x,y) e M x N \ 

^{x,y) exists}. We say that c satisfies the right twist condition if AJJ is injective on 

The usefulness of these definitions will be clear in the section HI in which we will treat 
the case where M = N and the cost is induced by a Lagrangian. This condition has 
appeared already in the subject. It has been known (explicitly or not) by several people, 
among them Gangbo (oral communication) and Villani (see [HI page 90]). It is used in 
[7j, since it is always satisfied for a cost coming from a Lagrangian, as we will see below. 
We borrow the terminology "twist condition" from the theory of Dynamical Systems: if 
/i : M X M — i> M, (x, y) 1—^ h{x,y) is C^, one says that h satisfies the twist condition if 

there exists a constant a > such that > a everywhere. In that case both maps 

oxoy 

A{ : R X R ^ R X R, {x,y) ^ {x,dh/dx{x,y)) and A^; : M x M ^ M x R,{x,y) ^ 
{y, dh/dy{x, y)) are C"^ diffeomorphisms. The twist map / iMxM— >Mx]R associated to 
h is determined by f{xi,Vi) = {x2,V2), where Vi = —dh/dx{xi,X2),V2 = dh/dy{xi,X2), 
which means f{xi,Vi) = A}^ o [Ay~^(a;i, — fi), see [32] or 

We now recall some useful measure-theoretical facts that we will need in the sequel. 

Lemma 2.5. Let M he an n- dimensional manifold, N be a Polish space, and let c : 
M X N R be a measurable function such that x ^ c{x, y) is continuous for any y & N . 
Then the set 

dc 

{{x,y) I —{x,y) exists} is Borel measurable. 
Moreover (x, y) i— >■ ^{x, y) is a Borel function on that set. 

Proof. This a standard result in measure theory, we give here just a sketch of the proof. 
By the locality of the statement, using charts we can assume M = M". Let Tk : M" — > 
be a dense countable family of linear maps. For any j, A; G N, we consider the Borel 

function 



[x, y) : = sup 



\c{x + h,y)- c(x, y) - T^{h) \ 



/i|6(0,|) 1^1 

\c{x + h,y) -c{x,y) - Tk{h)\ 



sup 

|/i|e(o,-),/ieQ" I'*'! 



where in the second equality we used the continuity of a; t— > c(x, y). Then it is not difficult 

dc j 

{{x,y) I inf inf Lj,fc(x,y) = 0}, 



to show that the set of point where ^{x,y) exists can be written as 
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which is clearly a Borel set. 

dc 
dx 



To show that x t— >■ ^(x, y) is Borel, it suffices to note that the partial derivatives 



dc / X . . . , Xj -|- „,..., Xjj, y) c(xi, . . . , Xj, . . . , x^, ?/) 
TT — X, v) = lim — 

are countable limits of continuous functions, and hence are Borel measurable. □ 

Therefore, by the above lemma, 2^(A[,) is a Borel set. If we moreover assume that c 
satisfies the left twist condition (that is, is injective on P(A^)), then one can define 

{A[)-^ : T*M D A[{V{A[)) V{A[) C M x N. 

Then, by the injectivity assumption, one has that A[{V{A[)) is still a Borel set, and (A^)^^ 
is a Borel map (see |T7l Proposition 8.3.5 and Theorem 8.3.7], [22] )■ We can so extend 
(A^)^^ as a Borel map on the whole T*M as 

AMn.., „^_ / iKyHx,p) ifpeT:MnAiiViA[)), 

^"^'^^ \ix,y) Hp eT:M \A[iViA[)), 

where y is an arbitrary point, but fixed point, in A^. 



3 The main result 

In order to have general results of existence and uniqueness of transport maps which are 
sufficiently flexible so that they can be used also in other situations, and to well show 
where measure-theoretic problems enter in the proof of the existence of the transport map, 
we will ffist give a general result where no measures are present (see in the appendix IA.3I 
for the definition of locally semi-concave function and IA.7I for the definition of countably 
{n — 1)-Lipschitz set). 

Theorem 3.1. Let M be a smooth (second countable) manifold, and let N be a Polish 
space. Assume that the cost c : M x A^ — >■ R is Borel measurable, bounded from below, 
and satisfies the following conditions: 

(i) the family of maps x i— > c(x, y) = Cy{x) is locally semi-concave in x locally uniformly 
in y, 

(ii) the cost c satisfies the left twist condition. 

Let {if, ip) be a c-subsolution, and consider the set G M x N given by 

G(^,^) = {{x,y) e M X N \ i){y) - (^(x) = c(x, y)}. 
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We can find a Borel countably [n — 1)-Lipschitz set E G M and a Borel measurable map 
T : M ^ N such that 

C Graph(r) U7rJ^:/(E), 

whercTTM '■ MxN —>■ M is the canonical projection, aiid Graph (T) = {(x,T(x)) | x G M} 
is the graph ofT. 

In other words, if we define P = ttm ,^)) C M the part of G(<^,^) which is above 
P\E is contained a Borel graph. 

More precisely, we will prove that there exist an increasing sequence of locally semi- 
convex functions : M — M, with (f > <fn+i > fn on M, and an increasing sequence of 
Borel subsets Cn such that 

• For X G Cn, the derivative dx^n exists, ipn+i{x) = ipn{x), and d^ipn+i = dxipn- 

• If we set C = U„C„, there exists a Borel countably (n — 1)-Lipschitz set E G M 
such that P\E CC. 

Moreover, the Borel map T : M ^ N is such that 

• For every x E Cn, we have 

(a;,T(a;)) = A^'"^(x,-4<^„), 

where A^'*"*' is the extension of the inverse of A[ defined at the end of section \2i 

Oc 

• If X E P D Cn \ E, then the partial derivative —{x,T{x)) exists (i.e. (x,T(x)) G 
V{A[) ), and 

Oc 

— (x,T(x)) = -dx^Pn- 
In particular, if x E P H Cn \ E , we have 

{x, T{x)) E V{K[) and K[{x, T{x)) = {x - 4<^n). 
Therefore, thanks to the twist condition, the map T is uniquely defined on P\E G C. 

The existence and uniqueness of a transport map is then a simple consequence of the 
above theorem. 

Theorem 3.2. Let M be a smooth (second countable) manifold, let N be a Polish space, 
and consider /i and v (Borel) probability measures on M and N respectively. Assume 
that the cost c : M x N —>■ M is lower semicontinuous and bounded from below. Assume 
moreover that the following conditions hold: 
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(i) the family of maps x \—>- c{x, y) = Cy{x) is locally semi-concave in x locally uniformly 
in y, 

(ii) the cost c satisfies the left twist condition, 

(iii) the measure /i gives zero mass to countably {n — 1)-Lipschitz sets, 

(iv) the infimum in the Kantorovitch problem is finite. 

Then there exists a Borel map T : M ^ N, which is an optimal transport map from fi to 
V for the cost c. Morover, the map T is unique fi-a.e., and any plan 7c G n(yU, u) optimal 
for the cost c is concentrated on the graph ofT. 

More precisely, if [cp, ip) is a (c, '-fc) -calibrating pair, with the notation of Theorem \3.1\ 
there exists an increasing sequence of Borel subsets En, with niUnBn) = 1, such that the 
map T is uniquely defined on B = U„-B„ via 

Oc 

— {x,T{x)) = -d^ifn on Bn, 

and any optimal plan 7 G n(/i, u) is concentrated on the graph of that map T. 
We remark that condition (iv) is trivially satisfied if 

/ c(x, y) dfi{x) diy{y) < +00. 
Jmxn 

However we needed to stated the above theorem in this more general form in order to 
apply it in section (see Remark [5.21) . 

Proof of Theorem \3M Let 7c G n(/i, v) be an optimal plan. By Theorem 12.31 there exists 
a (c, 7)-calibrated pair (v5,V^). Consider the set 

G = = {{x,y) e M X N \ ^(y) - ip{x) = c{x,y)}. 

Since both M and are Polish and both maps ip and ip are Borel, the subset G is a 
Borel subset of M x A^. Observe that, by the definition of (c, 7c)-calibrated pair, we have 
7c(G) = 1. 

By Theorem 13.11 there exists a Borel countably {n — 1)-Lipschitz set E such that 
G \ {7Tm)^^{E) is contained in the graph of a Borel map T. This implies that 

E = TTM {G \ {nM)-\E)) = nM{G) \EcM 

is a Borel set, since it coincides with (Mm xT)^^ (G \ {'n-M)~^{E)) and the map x 1— > 
Id-M y<T{x) = (x,T(x)) is Borel measurable. 
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Thus, recaUing that the first marginal of 7c is ^, by assumption (iii) we get 7c((7rAf)~^(i?)) = 
yu(-E) = 0. Therefore 7c(G' \ {nM)~^{E)) = 1, so that 7c is concentrated on the graph 
of T, which gives the existence of an optimal transport map. Note now that /i(-B) = 
7c(7r-i(5)) > 7c(G \ {ttm)-\E)) = 1. Therefore = 1. Since B = P \ E, 

where P = ttm{G), using the Borel set C„ provided by Theorem I3.H it follows that 
Bn = P r\ Cn \ E = D n Cn is a Borel set with B = U„i?„. The end of Theorem 13.11 shows 
that T is indeed uniquely defined on B as said in the statement. 

Let us now prove the uniqueness of the transport map /i-a.e.. If S is another optimal 
transport map, consider the measures 7t = (Wm xT)^^ and 75 = (Wm xS')#/i. The 
measure 7 = |(7r + 75) ^ n(/i, i/) is still an optimal plan, and therefore must be con- 
centrated on a graph. This implies for instance that S = T /i-a.e., and thus T is the 
unique optimal transport map. Finally, since any optimal 7 G n(yU, u) is concentrated on 
a graph, we also deduce that any optimal plan is concentrated on the graph of T. □ 

Proof of Theorem \3.1[ By definition of c-subsolution, we have (f > —00 everywhere on 
M, and ip < +00 everywhere on N. Therefore, if we define Wn = {ip < n}, we have 
Wn C Wn+i, and UnWn = N. Since, by hypothesis (i), c{x,y) = Cy{x) is locally semi- 
concave in X locally uniformly in y, for each y & N there exist a neighborhood Vy of y 
such that the family of functions {c{-,z))z£Vy is locally uniformly semi-concave. Since 
is separable, there exists a countable family of points {yk)km such that l-ikVy^ = N. We 
now consider the sequence of subsets (Ki)nGN C N defined as 

We have K C Define if n ■ M N hy 

ipn{x) = sup tpiy) - c{x,y) = max ( sup tpiy) - c{x,y)) . 

y&Vn l<k<n \y(zW^nVy^ ^ 

Since ip < n on Kn, and — c is bounded from above, we see that ipn is bounded from above. 
Therefore, by hypothesis (i), the family of functions {ip{y) — c{-,y))y(zw„nVy^ is locally 
uniformly semi-convex and bounded from above. Thus, by Theorem IA.4I and Proposition 
lA.lll of the Appendix, the function ipn is locally semi-convex. Since iply) — if{x) < c{x, y) 
with equality on G'(^^^), and Vn C Vn+i, we clearly have 

fn < fn+i < f everywhere on M. 
A key observation is now the following: 

where Pn = 'nAiiGi^^p^^) fl (M x Vn)). In fact, if x G P„, by the definition of P„ we know 
that there exists a point y^ G Ki such that (x, y^) G By the definition of 

this implies 

if{x) = %}j{yx) - c(x, y^) < ifn{x) < if{x). 
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Since is locally semi-convex, by Theorem IA.8I of the appendix applied to — v^n, it is 
differentiable on a Borel subset such that its complement is a Borel countably 
{n — 1)-Lipschitz set. Let us then define F = (InFn. The complement E = F'^ = UnF^ is 
also a Borel countably (n — 1)-Lipschitz set. We now define the Borel set 

Cn = Fn{x e M \ v?fc(x) = ^n{x) \fk > n}. 

We observe that C„ D P„ n F. 

We now prove that fl ((P„ fl F) x Vn) is contained in a graph. 

To prove this assertion, fix a; G Pn H F. By the definition of P„, and what we said 
above, there exists Ux such that 

ip{x) = (pnix) = ipiy^) - c{x,y^). 

Since x E F, the map z t-^ ipniz) — ipiyx) is differentiable at x. Moreover, by condition 
(i), the map z ^ —c{z,yx) = —Cy^{z) is locally semi-convex and, by the definition of ipn, 
for every z G M, we have (pn{z) — ip{yx) > —c{z, y^), with equality ed, z = x. These facts 

Oc 

taken together imply that —{x,yx) exists and is equal to —dx^n- In fact, working in 

a chart around x, since Cy^ = c{-,yx) is locally semi-concave, by the definition IA.3I of a 
locally semi-concave function, there exists linear map such that 

c{z, yx) < c{x, yx) + lx{z - x) + o{\z - x\), 

for z in a neighborhood of x. Using also that is differentiable at x, we get 

(/9„(x) - i){yx) + dx(pn{z -x) + o{\z - x\) = ipn{z) - i){yx) 

> -c{z, yx) > -c(x, yx) - lc{z - x) + o{\z - x\) 
= (pn{x) - i){yx) - lc{z - x) + o{\z - x\). 

This implies that Ic = —dxfn, and that Cy^ is differentiable at x with differential at x 
equal to —dx^n- Setting now Gx = {y E N \ ^{x) — ip{y) = c{x,y)}, we have just 

dc 

shown that {x} x [Gx H Vn) C 2^(A^) for each x G C„, and also —{x,y) = —dx^Pn, for 

every y E Gx Vn- Recalling now that, by hypothesis (ii), the cost c satisfies the left 
twist condition, we obtain that G^. fl is reduced to a single element which is uniquely 
characterized by the equality 

Oc 

So we have proved that G fl (M x Vn) is the graph over P„ fl P of the map T defined 
uniquely, thanks to the left twist condition, by 

Oc 

— (x,T(x)) = -dxPn 

11 



(observe that, since < (pk for k > n with equahty on P„, we have da;(pn\p„ = dx^Pk\p„ 
for k > n). Since Pn+i 3 Pn, and V„ C Vn+i N, we can conclude that G'(<^_^) is a graph 
over U„P„ n F = P n F (where P = 7rM(G'(^,^)) = U„P„). 

Observe that, at the moment, we do not know that T is a Borel map, since P„ is not 
a priori BoreL Note first that by definition of P„ C Pn+i, we have ipn = V^n+i on P„, and 
they are both differentiable at every point of P„. Since ipn < fn+i everywhere, by the 
same argument as above we get d^^Pn = d^ip^+i for x & Bn- Thus, setting B = U„P„, we 
can extend T to M by 



where ir^ : M x N ^ N is the canonical projection, A^'*"^ is the Borel extension of (A^)^^ 
defined after Lemma I2.5[ and y is an arbitrary but fixed point in N. Obviously, the map 
T thus defined is Borel measurable and extends the map T already defined on P \ P. □ 

In the case where /i is absolutely continuous with respect to Lebesgue measure we 
can give a complement to our main theorem. In order to state it, we need the following 
definition, see |4j, Definition 5.5.1, page 129]: 

Definition 3.3 (Approximate differential). We say that / : M ^ R has an approxi- 
mate differential at x G M if there exists a function h : M ^ M. differentiable at x such 
that the set {/ = h} has density 1 at x with respect to the Lebesgue measure (this just 
means that the density is 1 in charts). In this case, the approximate value of / at x is 
defined as f{x) = h{x), and the approximate differential of / at x is defined as d^f = d^h. 
It is not difficult to show that this definition makes sense. In fact, both h{x), and dxh do 
not depend on the choice of h, provided x is a density point of the set {/ = h}. 

Another characterization of the approximate value /(x), and of the approximate dif- 
ferential dxf is given, in charts, saying that the sets 



have density at x for each e > with respect to Lebesgue measure. This last definition 
is the one systematically used in [22]. On the other hand, for the purpose of this paper. 
Definition 13.31 is more convenient. 

The set points x G M where the approximate derivative d^f exists is measurable; 
moreover, the map x i— >■ dxf is also measurable, see [221 Theorem 3.1.4, page 214]. 

Complement 3.4. Under the hypothesis of Theorem 13.21 if we assume that /j, is abso- 
lutely continuous with respect to Lebesgue measure (this is stronger than condition (Hi) 
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of Theoreni \3.2\) . then for any calibrated pair ip), the function ip is approximatively dif- 
ferentiable fi-a.e., and the optimal transport map T is uniquely determined fi-a.e., thanks 
to the twist condition, by 

Oc 

— (x,T(x)) = -d^if, 

where d^^p is the approximate differential of (p at x. Moreover, there exists a Borel subset 

A G M of full fi measure such that d^ip exists on A, the map x i— > d^cp is Borel measurable 
Oc 

on A, and t— (x, T(x)) exists for x & A (i.e. (x, T(x)) G ^^(A') ). 
ox 

Proof. We will use the notations and the proof of Theorems 13.11 and 13. 2[ We denote 
by An C Bn the set of x G -B„ which are density points for i?„ with respect to some 
measure A whose measure class in charts is that of Lebesgue (for example one can take A 
as the Riemannian measure associated to a Riemannian metric). By Lebesgue's Density 
Theorem X{Bn \ An) = 0. Since is absolutely continuous with respect to Lebesgue 
measure, we have /i(v4„) = fi{Bn), and therefore A = UnAn is of full /i-measure, since 
fi{Bn) y yu(-B) = 1. Moreover, since ip = ipn on Bn, and (pn is differentiable at each 
point of Bn, the function ip is approximatively differentiable at each point of An with 
dx<-P = d^ipn- 

The last part of this complement on measurability follows of course from [22l Theorem 
3.1.4, page 214]. But in this case, we can give a direct simple proof. We choose An C An 
Borel measurable with \ An) = 0. We set A = UnAn- The set A is of full /i measure. 
Moreover, for every x & An, the approximate differential d^ip exists and is equal to d^ipn- 
Thus it suffices to show that the map x ^— d^ipn is Borel measurable, and this follows as 
in Lemma [2.51 □ 

Remark 3.5. We observe that the key steps in the results of this section, translated in a 
bit different language, can be summarized in the following way. Let <^:M— s>RU{+oo} 
be a c-convex function, that is 

(p{x) = snpipiy) - c{x,y) 

for a certain : iV — > M U {— cxd}, and assume ip ^ +oo. Then, if c satisfies hypothesis 
(i) of Theorem l3.lt we can prove the following: 

(1) There exists a non- decreasing sequence ipn of locally semi-convex functions such that 
From this we deduce: 

(2a) There exists a Borel section P{x) of the cotangent bundle T*M and a (n — 1)- 
Lipschitz set E G M such that, if x ^ E, P{x) is the only possible sub-differential 
of ip at X. 
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(2b) The function is approximately different iable a.e. with respect to the Lebesgue 
measure. 



By these facts, the duahty theorem and the semi- concavity of the cost, we can deduce 
the existence and uniqueness of an optimal transport (under the general assumptions of 
Theorem l3.2l we use (2a), while under the assumption that the source measure is absolutely 
continuous with respect to the Lebesgue measure we use (2b)). 

4 Costs obtained from Lagrangians 

Now that we have proved Theorem 13.21 we want to observe that the hypotheses are 
satisfied by a large class of cost functions. 

We will consider first the case of a Tonelli Lagrangian L on a connected manifold (see 
Definition IB. 41 of the appendix for the definition of a Tonelli Lagrangian). For t > 0, the 
cost Ct^L : M X M — i> M associated to L is given by 



where the infimum is taken over all the continuous piecewise curves 7 : [0, t] M, 
with 7(0) = X, and •yit) = y (see Definition IB. 181 of the appendix). 

Proposition 4.1. If L : TM — > R is a Tonelli Lagrangian on the connected manifold M, 
then, for t > 0, the cost Ct^i : M x M — R associated to the Lagrangian L is continuous, 
bounded from below, and satisfies conditions (i) and (ii) of Theorem \3.2[ 

Proof. Since L is a Tonelli Lagrangian, observe that L is bounded below by C, where C 
is the constant given in condition (c) of Definition IB. 41 Hence the cost q^l is bounded 
below by tC. By Theorem IB. 191 of the appendix, the cost Ct^L is locally semi-concave, and 
therefore continuous. Moreover, we can now apply Proposition IA.17I of the appendix to 
conclude that ct^i satisfies condition (i) of Theorem 13.21 

The twist condition (ii) of Theorem 13.21 for follows from Lemma IB. 221 and Propo- 
sition [B:231 ' □ 

For costs coming from Tonelli Lagrangians, we subsume the application of the main 
Theorem 13.21 and its Complement 13.41 

Theorem 4.2. Let L be a Tonelli Lagrangian on the connected manifold M. Fix t > 0, 
IX, V a pair of probability measure on M, with n giving measure zero to countably (n — l)- 
Lipschitz sets, and assume that the infimum in the Kantorovitch problem ([1]) with cost 
Ct^L is finite. Then there exists a uniquely ix-almost everywhere defined transport map 



^ Jo 
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T : M —y M from fi tou which is optimal for the cost Ct^i- Moreover, any plan 7 G 11 (/x, z/), 
which is optimal for the cost Ct^i, verifies 7(Graph(T)) = 1. 

If /i is absolutely continuous with respect to Lebesgue measure, and {ip, ip) is a Ct^L- 
calibrated subsolution for (/x, u), then we can find a Borel set B of full jj, measure, such 
that the approximate differential d^f of (f at x is defined for x E B, the map x ^ d^ip 
is Borel measurable on B, and the transport map T is defined on B (hence ii-almost 
everywhere) by 

T{x) = 7r*0f (x,4v5), 

where n* : T*M ^ M is the canonical projection, and (pf is the Hamiltonian flow of the 
Hamiltonian H associated to L. 

We can also give the following description for T valid on B (hence ^-almost every- 
where): 

r ' ^ 

T{x) = Ti^t (x,gra4((^)), 

L 

where 0^ is the Euler-Lagrange flow of L, and x — > grad^ is the measurable vector 
field on M defined on B by 

Ql^ ^ ^ 

— (x,gra4((^)) = 

Moreover, for every x E B, there is a unique L-minimizer 7 : [0,t] — >■ M, with 7(0) = 

L 

X, ■y(t) = T{x), and this curve 7 is given by 7(3) = 7r0^(x, grad^ (v^)), fo-f < s < t. 

Proof. The first part is a consequence of Proposition 14.11 and Theorem 13.21 When /i is 
absolutely continuous with respect to Lebesgue measure, we can apply Complement 13.41 
to obtain a Borel subset A C M of full /i measure such that, for every x G A, we have 
(a;,T(a;)) gP(A^^ J and 

-^{x,T{x)) = d^ip. 

By Lemma lB.221 and Proposition IB.23| if {x,y) G V{A[^^), then there is a unique L- 
minimizer 7 : [0,t] M, with 7(0) = x,'y{t) = y, and this minimizer is of the form 
7(5) = 7T(j)g{x,v), where vr : TM — ^ M is the canonical projection, and v G T^M is 
uniquely determined by the equation 

L ^ L 

Therefore T{x) = 7r0^(x, grad^. ((/?)), where gTa.d^{ip) is uniquely determined by 

|j^(x,grad^(¥?)) = -^^(x,T(x)) = d^^ip, 



15 



which is precisely the second description of T. The first description of T follows from the 
second one, once we observe that 

— 'L dL —L 

if(a;,grad^((^))) = (a;, — (x, grad^ ((^)) = (x, 4<^) 
7r* = vr, 

where =Sf : TM —>■ T*M is the global Legendre Transform, see Definition IB. 81 of the 
appendix. □ 

We now turn to the proof of Theorem 11.11 which is not a consequence of Theorem 14.21 
since the cost with r > 1 does not come from a Tonelli Lagrangian for r ^ 2. 

Theorem 4.3. Suppose that the connected manifold M is endowed with a Rieniannian 
metric g which is complete. Denote by d the Riemannian distance. If r > 1, and fi 
and V are probability (Borel) measures on M, where /i gives measure zero to countably 
{n — 1)-Lipschitz sets, and 



/ d''{x,Xo) d^{x) < oo and / 
Jm Jm 



[x, Xq) du{x) < oo 



for some given Xq G M , then we can find a transport map T : M M, with T^j^fi = u, 
which is optimal for the cost d^ on M x M. Moreover, the map T is uniquely determined 
fi-almost everywhere. 

If /i is absolutely continuous with respect to Lebesgue measure, and (y?, ip) is a cali- 
brated subsolution for the cost d^{x, y) and the pair of measures {fi, v), then the approxi- 
mate differential d^f of ip at x is defined fi-almost everywhere, and the transport map T 
is defined ^-almost everywhere by 

N / grad^(<^) ^ 

T(x)=exp^/ 



VV(-i)||grad'(<^)||i^-')/(^-') 

' 9 

where the approximate Riemannian gradient grad {(p) of (p is defined by 

^^(grad^. ((/?),-) = 

and exp : TM — ^ M is the exponential map of g on TM, which is globally defined since 
M is complete. 

Proof. We first remark that 

d^'ix, y) < [d{x, xo) + d{xo, y)Y 

< [2max{d{x,Xo),d{xo,y))Y 
<2'-[d{x,xor + d{y,xoY]. 
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Therefore 



/ cr{x,y) dfi{x)diy{y) < / 2'^[d{x, XqY + d{y, xqY] dfi{x)diy{y) 

'mxM JmxM 



d^{x, Xq) dfi{x) + T' / (S\y, Xq) dv{%j) < oo, 
M Jm 

and thus the infimum in the Kantorovitch problem (jlj) with cost d^ is finite. 

By Example IB.Sj the Lagrangian Lr^g{x,v) = \\v\\l = gx{v^vY^'^ is a weak Tonelli 
Lagrangian. By Proposition IB.24"t the non-negative and continuous cost d'^{x,y) is pre- 
cisely the cost Ci^L^^. Therefore this cost is locally semi-concave by Theorem IB. 191 By 
Proposition lA.lTl this implies that d'^{x,y) satisfies condition (i) of Theorem 13. 2[ The 
fact that the cost d'^{x, y) satisfies the left twist condition (ii) of Theorem 13.21 follows from 
Proposition IB. 241 Therefore there is an optimal transport map T. 

If the measure /i is absolutely continuous with respect to Lebesgue measure, and {ip, ip) 
is a calibrated subsolution for the cost d'^{x,y) and the pair of measures (/x, z^), then by 
Complement 13.41 for /i- almost every x, we have {x,T{x)) G ^^(Aii^^ ), and 

Since (x,T(x)) is in ©(A^^^ ), it follows from Proposition IB. 241 that T(x) = Tr(j)l{x,Vx), 
where vr : TM — ^ M is the canonical projection, the fiow (pf is the geodesic fiow of g on 
TM, and Vx € T^M is determined by 

— (x,r(x)) = —{x,v^), 

ox ov 

or, given the equality above, by 



dLr^g 



X,Vx) = dx(p. 



dv 

Now the vertical derivative of Lr^g is computed in Example IB. 51 

dLr g, < II 1 1 r-2 / \ 

——(^x,v)=r\\v\\^ gx{v,-)- 
Hence v^ E T^M is determined by 



r\\vx\\l ^gx{vx, ■) = dx<p = gx{gTadl{<p), ■). 



This gives the equality 
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from which we easily get 



r 



/(-i)||grad:(¥.)||i' 



(r-2)/(r-l) • 



Therefore 



grad^((^) 



r 



/(-i)||grad:(vp)||i' 



(r-2)/(r-l) 



By definition of the exponential map exp : TM — > M, we have exp^{v) = vr^f (x, f ), and 
the formula for T{x) follows. □ 

5 The interpolation and its absolute continuity 

For a cost ct^i coming from a Tonelli Lagrangian L, Theorem 14.21 shows not only that 
we have an optimal transport map T but also that this map is obtained by following an 
extremal for time t. We can therefore interpolate the optimal transport by maps Tg where 
we stop at intermediary times s G [0,t]. We will show in this section that these maps are 
also optimal transport maps for costs coming from the same Lagrangian. Let us give now 
precise definitions. 

For the sequel of this section, we consider L a Tonelli Lagrangian on the connected 
manifold M. We fix t > and /iq and fit two probability measures on M, with /iq 
absolutely continuous with respect to Lebesgue measure, and such that 



We call Tt the optimal transport map given by Theorem 14.21 for (q^l, /iq, /it). We denote 
by {'^,4') a fixed (q^^,, 7j)-calibrated pair. Therefore 7t = (Hm xTt)^fiQ is the unique 
optimal plan from /iq to fif By Theorem 14.21 we can find a Borel subset B G M such 
that: 

• the subset B is of full /xq measure; 

• the approximate ddifferential d^f exists for every x G B, and is Borel measurable 




on B ; 



• the map Tt is defined at every x G B, and we have 



Tt{x) = 7r0f(x,grad^(¥?)), 
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where is the Euler- Lagrange flow of L, tt : TM — > M is the canonical projection, 

— — L 

and the Lagrangian approximate gradient x i— > grad^. ((/?)) is defined by 

Q]^ , ^ 

— (x,grad^((^)) = 4<^; 

• for every x E B, the partial derivative —{x,Tt(x)) exists, and is uniquely defined 

by 

dc . rn r W 1 

—{x,Tt{x)) = -dx(p; 

• for every x & B there exists a unique L-minimizer '■ [0, t] ^ M between x and 
Tt{x). This L-minimizer 7^; is given by 

Vs e [0, ti -tx{s) = 7r0f (x, gT^^'(¥^)); 

• for every x & B, we have 

7P{Tt{x))-ip{x)^Ct,L{x,Tt{x)). 

For s G [0,t], we can therefore define an interpolation Tg : M ^ M defined on B, and 
hence /iq almost everywhere, by 

T L 

Ts{x) =7;,(s) =7r0,(x,grad^((^)). 

Each map Tg is Borel measurable. In fact, since the global Legendre transform is a 
homeomorphism and the approximate differential is Borel measurable, the Lagrangian 

L 

approximate gradient grad {ip) is itself Borel measurable. Moreover the map tt^^ : TM — > 
M is continuous, and thus Tg is Borel measurable. We can therefore define the probability 
measure /ig = Tg^iiQ on M, i.e. the measure ^g is the image of //q under the Borel 
measurable map Tg. 

Theorem 5.1. Under the hypotheses above, the maps Tg satisfies the following properties: 

(i) For every s e]0, t[, the map Tg is the (unique) optimal transport maps for the cost 
Cg^L 8i,nd the pair of measures {/Iq, /ig). 

(ii) For every s G]0,t[, the map Tg : B ^ M is injective. Moreover, if we define 
Cs,L{x,y) = Cg^L{y,x), the inverse map T~^ : Tg{B) B is the (unique) optimal 
transport map for the cost Cg^L ^.nd the pair of measures {fig, /iq), and it is countably 
Lipschitz (i.e. there exist a Borel countable partition ofM such that T~^ is Lipschitz 
on each set). 
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(iii) For every s e]0, t[, the measure /ig — T's#/^o absolutely continuous with respect to 
Lebesgue measure. 

(iv) For every s e]0,t[, the composition Tg — TfTf^ is the (unique) optimal transport 
map for the cost Ct-s,L Bind the pair of measures {/ig , IJ't), and it is countably Lipschitz. 

Proof. Fix s e]0,t[. It is not difficult to see, from tfie definition of q^l, tliat 

Vx, y,z e M, Ct,L{x, y) < Cs,l{x, y) + Ct-s,L{y, z), (1) 

and even tfiat 

Vx, zeM, ct,i{x, z) = inf Cs,l{x, y) + CtsAv^ 

If 7 : [a, 6] — >• M is an L-minimizer, tlie restriction 7|[c,di to a subinterval [c,d] C [a,b] is 
also an L-minimizer. In particular, we obtain 

Vs e]a, b[, C6_„,L(7(a), 7(6)) = c,_„,L(7(a), 7(5)) + C6_,,l(7(s), 7(6)). 

Applying this to the L-minimizer j^, we get 

Vx e ct,i(x, Tt{x)) = Cs,L(a;, ^^(a;)) + Ct-s,L{'^s{x),Tt{x)). (2) 

We define for every s e]0, i[, two probability measures 7^, % on M x M, by 

7s = (Mm xrj#^o and % = {TsXTt)#iio: 

where Hm xT^ and TgXTt are the maps from the subset B of full /Xq measure to M x M 
defined by 

(Mm y<Ts){x) = {x,Ts{x)), 
{T,xTt)ix) = {T,{x),Z{x)). 

Note that the marginals of 7^ arc {fiQ.fis), and those of 7^ are {fig, fit)- We claim that 
Cs,L{x,y) is integrablc for 7^ and M fact, we have C = mfxM L > —00, hence 

Cr,L > Cr. Therefore, the equality (2) gives 

Vx e B, [ct,L{x,Tt{x)) - Ct] = [cs,L{x,Tg{x)) - Cs] + [ct-s,L{Ts{x),Tt{x)) - C{t - s)]. 

Since the functions between brackets are all non-negative, we can integrate this equality 
with respect to /iq to obtain 

/ [ct,L{x,y)-Ct]d-ft^ I [cs,L{x,y)-Cs]d'ys+ [ct_5,L(x, y) - C(i - s)] ^7^. 
Jmxm Jmxm Jmxm 
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But all numbers involved in the equality above are non-negative, all measures are proba- 
bility measures, and the cost Ct^i is 7t integrable since •yt is an optimal plan for {ct^L, fJ-o, f^t), 
and the optimal cost of (q^l, /Wq, /it) is finite. Therefore we obtain that Cs^l is 7s-integrable, 
and Ct-s,L is 7s-integrable. 

Since by definition of a calibrating pair we have (f > —oo and ip < +00 everywhere 
on M, we can find an increasing sequence of compact subsets Kn C M with UnKn = M, 
and we consider Vn = Kn n > —n}, = Kn n < n}, so that U„V"„ = U„V^ = M. 

We define the functions M M by 

z£Vn 

if:1{z) = sup ^(5) - Ct-s,L{z, Z), 

where ip) is the fixed Q^L-calibrated pair. Note that ip^ is bounded from below by 
—n + tmiTM L > —00. Moreover, the family of functions {(p{z) + Cs^l{z, ■))zgv,[ is locally 
uniformly semi-concave with a linear modulus, since this is the case for the family of func- 
tions {cs^l{z, ■))z£V^, by Theorem IB. 191 and Proposition IA.17I It follows from Proposition 
lAlHl that is semi-concave with a linear modulus. A similar argument proves that —if"^ 
is semi-concave with a linear modulus. Note also that, since Vn and are both increasing 
sequences, we have ip^ > ipg'~^^ and <f^~^^ > f^, for every n. Therefore we can define ips 
(resp. xps) as the pointwise limit of the sequence v?" 

Using the fact that {ip,ip) is a Q^L-subsolution, and inequality (1) above, we obtain 

Vx, y,z e M, tjj{y) - CtsA^^ v) < V'(^) + Cs,l{x, z). 
Therefore we obtain ioi x & Vn,y & Vn, z & M 

tpiy) - Ct-sAz, y) < <^"(^) < Vs{z) < i)s{z) < C(^) < V^(a^) + Cs,l(x, z). (3) 

Inequality (3) above yields 

Vx, y,z e M, i){y) - Ct-s,L{z, y) < ^s{z) < i^siz) < ip{x) + c,,l(x, z). (4) 

In particular, the pair {(p,ips) is a Cs,L-subsolution, and the pair {(ps,^^) is a Ct-s,L- 
subsolution. Moreover, ip, ipg, ips and ip are all Borel measurable. 
We now define 

Bn = BnVnnTi\v:^), 

so that UnBn = B has full /iQ-measure. 

If a; G Bn, it satisfies x G \4 and Tt{x) G Vn- From Inequality (3) above, we obtain 

ij{Tt{x)) - Ct.s,L{Ts{x),Tt{x)) < ^:{T,{x)) < ^s{Ts{x)) 

< MTsix)) < ^:{Ts{x)) < ip{x) + Cs,l{x,Ts{x)) 
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Since -B„ C B, for x G -B„, we have ip(Tt{x)) — (p{x) = Ct^iix, Tt{x)). Combining this with 
Equahty (2), we conclude that the two extreme terms in the inequahty above are equaL 
Hence, for every x G -B„, we have 

^(T,(x)) - Ct-sATsi^)Mx)) = ^:{Tsix)) = MUx)) 

= MTs{x))=ij:{Ts{x)) = ^{x) + Cs,L{x,T,{x)). (5) 

In particular, we get 

Vx G B, i)s(Ts{x)) = ip{x) + Cs,l{x, Ts{x)), 

or equivalently 

i^siy) - = Cs,L{x,y) for 7s-a.e. {x,y). 

Since {(p,ips) is a (Borel) c^^i-subsolution, it follows that the pair {(p,ips) is {cg^Lyls)- 
calibrated. Therefore, by Theorem 12.31 we get that 7^ = (Id^ xTs)^/io is an optimal plan 
for (cs,L, /io, /^s)- Moreover, since Cs^l is 7s-integrable, the infimum in the Kantorovitch 
problem (jlj) in Theorem 13.21 with cost Cs^l is finite, and therefore there exists a unique 
optimal transport plan. This proves (i). 

Note for further reference that a similar argument, using the equality 

Vx G B, tp{Tt{x)) = ips(Ts{x)) + Ct-s,L{Ts{x),Tt{x)), 

which follows from Equation (5) above, shows that the measure % = (TsXTt)#fio is an 
optimal plan for the cost Ct-s,L and the pair of measures {fis,fJ't)- 

We now want to prove (ii). Since B is the increasing union of = 5 fl fl Tf^{V^), 
it suffices to prove that Tg is injective on i?„ and that the restriction T~^\T(Bn) is locally 
Lipschitz on Ts{Bn). 

Since i?„ C Vn, by Inequahty (3) above we have 

Vx G B^,yy G M, ip^iy) < ^^{y) < ^{x) + Cs,L{x,y). (6) 

Moreover, by Equality (5) above 

Vx G 5„, ip:{Ts{x)) = ^:{Ts{x)) = ip{x) + Cs,l{x,Ts{x)). (7) 

In particular, we have ip^ < ip^ everywhere with equality at every point of Ts{Bn)- As 
we have said above, both functions ip"^ and — are locally semi-concave with a linear 
modulus. It follows, from Theorem IA.191 that both derivatives lizt^", dzip'^ exist and 
are equal for z G Ts{Bn). Moreover, the map z ^-^ dzf^ = dzip"^ is locally Lipschitz 
on Ts{Bn)- Note that we also get from (6) and (7) above that for a fixed x G we 
have v?" < p{x) + Cs,l(x, ■) everywhere with equality at Ts(x). Since pn is semi-convex, 
using that Cs,l(x, ■) is semi-concave, again by Theorem IA.191 we obtain that the partial 
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9cg ij 

derivative ^' (x,Ts(x)) of Cg l with respect to the second variable exists and is equal 
dy 

to dT^(^x)'^^ = dT^{x)'ips- Since '■ [0,t] ^ M is an L-minimizer with 7^(0) = x and 
73; (s) = Ts{x), it follows from Corollary IB.20I that 

c?r4x)C = ^-^i.^^Ts{x)) = |^(7x(s),7x(s)). 

Since 7a; is an L-minimizer, its speed curve is an orbit of the Euler-Lagrange flow, and 
therefore 

(r,(x),rfT.(x)0 =^((7x(s),7x(s)) =^0f(7x(O),7.(O)), 

and 

a; = 7r0^,if-i(T,(x),dT.(x)O- 

It follows that Tg is injective on Bn with inverse given by the map 6'„ : Ts{Bn) 
defined, for z G Ts{Bn), by 

=7r</.^,if-i(z,40- 

Note that the map 9n is locally Lipschitz on Ts{Bn), since this is the case for z 1— >• dzipg, 
and both maps (p^^, =Sf ~^ are C^, since L is a Tonelli Lagrangian. An analogous argument 
proves the countably Lipshitz regularity of Tg = TtT~^ in part (iv). Finally the optimality 
of simply follows from 



min <^ / Cs^L{,x,y)d-i{x,y)\ = min <^ / Cs,Lix,y) d'y{x,y) 



M 



M 

Part (iii) of the Theorem follows from part (ii). In fact, if A C M is Lebesgue 
negligible, the image T~^{Ts{B) fl A) is also Lebesgue negligible, since is countably 
Lipschitz on Ts{B), and therefore sends Lebesgue negligible subsets to Lebesgue negligible 
subsets. It remains to note, using that B is of full /iQ-measure, that /is(A) = Ts^^o{A) = 
fj.o{T-\UB)nA)) = 0. 

To prove part (iv), we already know that % = {TsXTt)^fiQ is an optimal plan for the 
cost Ct-s,L and the measures Since the Borel set B is of full /iQ-measure, and 

Ts : B Ts{B) is a bijective Borel measurable map, we obtain that is a Borel map, 
and /io = T~J:Hs. It follows that 

% = ildMxTtT-')#fis. 
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Therefore the composition TtT~^ is an optimal transport map for the cost Ct-s,L and the 
pair of measures {fig, fit), and it is the unique one since Ct-s,L is 7s-integrable and /i^ is 
absolutely continuous with respect to the Lebesgue measure. □ 



Remark 5.2. We observe that, in proving the uniqueness statement in parts (i) and (iv) 
of the above theorem, we needed the full generality of Theorem 14.21 Indeed, assuming 



would have to be finite. So the existence and uniqueness of a transport map in Theorem 
13.21 under the integrability assumption on c with respect to fi® v instead of assumption 
(iv) would not have been enough to obtain Theorem 15. 1[ 

Remark 5.3. We remark that, if both /io and /if are not assumed to be absolutely 
continuous, and therefore no optimal transport map necessarily exists, one can still define 
an "optimal" interpolation (/is)o<s<t between /xq and fit using some measurable selection 
theorem (see [32') Chapter 7]). Then, adapting our proof, one still obtains that, for any 
s G (0,t), there exists a unique optimal transport map Ss for (c^^^, /i^, /xq) (resp. a unique 
optimal transport map 5*^ for {ct-s,L, fJ-s, fJ^t)), and this map is countably Lipschitz. 

We also observe that, if the manifold is compact, our proof shows that the above maps 
are globally Lipschitz (see [7j). 



A Semi-concave functions 

We give the definition of semi-concave function and we recall their principal properties. 
The main reference on semi-concave functions is the book [T3] . 
We first recall the definition of a modulus (of continuity). 

Definition A.l (Modulus). A modulus u is a continuous non- decreasing function u : 
[0, +oo) [0, +oo) such that uj{0) = 0. 

We will say that a modulus is linear if it is of the form uj{t) = kt, where > is some 
fixed constant. 




A/xM 



there is a priori no reason for which the two integrals 
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We will need the notion of superdifferential. We define it in an intrinsic way on a 
manifold. 

Definition A.2 (Superdifferential). Let / : M ^ R be a function. We say that p e T*M 

is a superdifferential of / at a; G M, and we write p G D~^f{x), if there exists a function 
g : V ^ defined on some open subset U G M containing x, such that g > f, 
g{x) = f{x), and g is different iable at x with dxg = p- 

We now give the definition of a semi-concave function on an open subset of a Euclidean 

space. 

Definition A. 3 (Semi-concavity). Let U C M"^ open. A function f : U ^ M is said to be 
semi-concave in U with modulus uj (equivalently cu-semi-concave) if, for each x & U, we 
have 

f{y) - f{x) < {k, y-x) + \\y- x\\u{\\y - x\\) 
for a certain linear form : R" — > R. 

Note that necessarily e D'^f{x). Moreover we say that f : U — > R is locally semi- 
concave if, for each x & U, there exists an open neighborhood of x in which / is semi- 
concave for a certain modulus. 

We will say that the function f : U ^ R is locally semi-concave with a linear modulus 
if, for each x & U, we can find an open neighborhood Vx such that the restriction f\v^ is 
a;-semi-concave, with cu a linear modulus. 

Proposition A.4. 1 ) Suppose : t/ ^ R, i = 1, . . . , /c is Ui-semi-concave, where U is an 
open subset of R". Then we have: 

(i) for any ai, . . . ,ak > 0, the functions Y^'^=i '^ifi (Si=i OLiUJi)-semi-concave on U . 

(ii) the function min^L^ fi is (max^L^^ a;^) -semi-concave. 
2) Any function is locally semi-concave. 

Proof. The proof of l)(i) is obvious. For the proof of (ii), we fix x G U, and we find 
io G {1, . . . ,k} such that min^^^ fi{x) — fio{x). Since /jg is a;j(,-semi-concave, we can find 
a linear map Zj. : R" — > R such that 

G U, fi^(y) - fi^(x) < lx(y -x) + \\y - x\\ujig(\\y - x\\). 

It clearly follows that 

k k 1^ 

Vy G U, min fi{y) -min/j(x) < kiy - x) + \\y - x\\ max LJi{\\y - x\\). 
1=1 1=1 1=1 

To prove 2), consider an open convex subset C with C compact and contained in U. 
By compactness of C and continuity of x i-^ dxf, we can find a modulus lu, which is a 
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modulus of continuity for the map x i— > d^f on C. The Mean Value Formula in integral 
form 

f{y) - f{x) = / dtx+{i-t)yf{y - x) dt, 

Jo 

which is valid for every y,x E C implies that 

Vx, y eU, f{y) - f{x) < d^f{y - x) + \\y - x\\u{\\y - x\\). 

Therefore / is a;-semi-concave in the open subset C. □ 

We now state and prove the first important consequences of the definition of semi- 
concavity. 

Lemma A. 5. Suppose U is an open subset of R". Let /:[/—> IR be an u; -semi-concave 
function. Then we have: 

(i) for every compact subset K C U, we can find a constant A such that for every 
X & K, and every hnear form on R" satisfying 

My e U, f{y) - f{x) < {k, y - x) + \\y - x\\uj{\\y -x\\), 

we have \\lx\\ < A; 

(a) the function f is locally Lipschitz. 

Proof. Prom the definition, it follows that a semi-concave function is locally bounded from 
above. We now show that / is also locally bounded from below. Fix a (compact) cube C 
contained in U and let {yi, . . . ,1/2"} be the vertices of the cube. Then, for each x & C, 
we can write x — ccjyj, with ai — l. By the semi-concavity of / we have, for each 
^ = 1,...,2-, 

fiVi) - fix) < {Ix, yi-x) + \\yi - x\\uj{\\yi - x\\); 
multiplying by a-i and summing over i, we get 

^ oiifiyi) < f{x) + ^ ai\\yi - x\\u;{\\yi - < f{x) + B, 

i i 

with B — Dcijj{Dc), where Dc is the diameter of the compact cube C. It follows that 

VxeC, f{x)>imuf{yi)-B. 

We now know that / is locally bounded. Using this fact, it is not difficult to show (i). In 
fact, suppose that the closed ball -B(a;o, 2r), r < -|-oo, is contained in U . For x e -B(a;o, r), 
we have x — rv & ]3{xQ,2r) cU for each v e R" with ||v|| = 1, and therefore 

f{x — rv) — f{x) < {1x1 —Tv) + ||— rt'||a;(||— r^ll) = —r{lx, v) + ruj{r). 
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Since, by the compactness of B{xQ,2r), we already know that B = sup^g^^^.^ 2r)l/('^)l 
finite, this imphes 

/, V fix) - fix -rv) , ^ 25 

{L, v) < ^ '-^ + uj{r) <—+ ujir). 

It follows that, for x G B{xo,r), 

< h uj[r). 

r 

Since the compact set K G U can be covered by a finite numbers of balls B{xi,ri), 
i = 1, . . . , i, we obtain (i). 

To prove (ii), we consider a compact subset K G U, and we apply (i) to obtain the 
constant A. We denote by the (finite) diameter of the compact set K. For each 
x,y e K, 

fiv) - fix) < {h,y-x) + \\y - x\\u{\\y - x\\) 

< {\m\+u{DK))\\y-x\\ 

< iA + ujiDK))\\y-x\\. 

Exchanging the role of x and y, we conclude that / is Lipschitz on K. □ 

Let us recall that a Lipschitz real valued function defined on an open subset of a Eu- 
clidean space is differentiable almost everywhere (with respect to the Lebesgue measure). 
Therefore by part (ii) of Lemma IA.5I above we obtain the following corollary: 

Corollary A. 6. A locally semi-concave real valued function defined on an open subset 
of a Euclidean space is differentiable almost everywhere with respect to the Lebesgue 
measure. 

In fact, in the case of semi-concave functions there is a better result which is given 
in Theorem IA.8I below, whose proof can be found in [131 Section 4.1]. Let us first give a 
definition: 

Definition A. 7. We say that 5 C M" is countably {n — 1)-Lipschitz if there exists a 
countable family of compact subsets Kj is a compact subset of such that: 

(i) E is contained in UjKj; 

(ii) for each j there exists a hyperplane Hj C = Hj(BHj-, where Hj- is the Euclidean 
orthogonal of Hj, such that Kj is contained in the graph of a Lipschitz function 
fj : Aj Hj- defined on a compact subset Aj C Hj. 
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Note that in the definition above, by the graph property (ii), the compact subset Kj 
has finite (n — l)-dimensional Hausdorff measure. Therefore any {n — 1)-Lipschitz set 
is contained in a Borel (in fact a-compact) {n — 1)-Lipschitz set with cr-finite {n — 1)- 
dimensional Hausdorff measure. 

Theorem A. 8. If if : U M is a semi-concme function defined on tie open subset 
U of M", tlien ip is differentiable at eaci point in the complement of a Borel countably 
[n — 1)-Lipschitz set. 

In order to extend the definition of locally semi-concave to functions defined on a 
manifold, it suffices to show that this definition is stable by composition with diffeomor- 
phisms. 

Lemma A. 9. Let U,V G MJ^ be open subsets. Suppose that F : V ^ U is a map. 
If f : U ^ M. is a locally semi-concave function then f o F : V — M is aiso locally 
semi-concave. Moreover, if F is of class C^, and M is a locally semi-concave 

function with a linear modulus then f o F : V is also locally semi-concave with a 
linear modulus. 

Proof. Since the nature of the result is local, without loss of generality we can assume that 
/ : [/ — s> M is semi-concave with modulus u. We now show that, for every V convex open 
subset whose closure V' is compact and contained in V, the restriction f o F\y' : V ^ R is 
a semi-concave function. We set Cy, = max^^y/\\DzF\\, and we denote by Uy, a modulus 
of continuity for the continuous function z i— > D^F on the compact subset V'. 
For each x, y in the compact convex subset V' C V, we have 

f{F{y)) - f{F{x)) < F{y) - F{x)) + \\F{y) - F{x)M\\F{y) - F{x)\\) 

< {lF(x),DF{x){y - x)) + \\lF(^^)\\ujy,{\\y - x\\)\\y - x\\ 
+ Cy,\\y - x\\uj{Cy,\\y - x\\); 

Since F{V') is a compact subset of U we can apply part (i) of Lemma IA.5I to obtain 
that Cy/ = supy,\\lp(^x)\\ is finite. This implies that / o F on is semi-concave with the 
modulus 

Oj{r) = Cy,UJy,{r) + Cy,Uj{Cy,r). 

If F is C^, then its derivative DF is locally Lipschitz on U, and we can assume that ujy/ 
is a linear modulus. Therefore, if a; is a linear modulus, we obtain that a) is also a linear 
modulus. □ 

Thanks to the previous lemma, we can define a locally semi-concave function (resp. 
a locally semi-concave function for a linear modulus) on a manifold as a function whose 
restrictions to charts is, when computed in coordinates, locally semi-concave (resp. lo- 
cally semi-concave for a linear modulus). Moreover, it suffices to check this locally semi- 
concavity in charts for a family of charts whose domains of definition cover the manifold. 
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It is not difficult to see that Theorem IA.8I is vahd on any (second countable) manifold, 
since we can cover such a manifold by the domains of definition of a countable family of 
charts. 

Now we want to introduce the notion of uniformly semi-concave family of functions. 

Definition A. 10. Let fi : U — >^ M, i e /, be a family of functions defined on an open 
subset U of M". We will say that the family {fi)i£i is uniformly uj- semi- concave, where 
a; is a modulus of continuity, if each fi is a;-semi-concave. We will say that the family 
ifi)iei is uniformly semi-concave if there exists a modulus of continuity u such that the 
family {fi)i^i is uniformly w-semi-concave. We will say that the family {fi)i(^i is uniformly 
semi-concave with a linear modulus, if it is uniformly cu-semi-concave, with uj of the form 
t I—*- kt, where is a fixed constant. 

Theorem A. 11. Suppose that /j : f/ — >■ M, z G /, is a family of functions defined on an 
open subset U of M". Suppose that this family {fiji^i is uniformly u -semi-concave, where 
uj is a modulus of continuity. If the function 

f{x) = mU,{x) 

is finite everywhere on U, then f : U ^ M. is also u- semi- concave. 

Proof. Fix xq e U. We can find a sequence in such that fi„{xQ) \ f{xQ) > — oo. We 
choose a cube C <ZU with center xq. Call yi, . . . , ?/2" the vertices of C. By the argument 
in the beginning of the proof of Lemma IA.5[ we have 

Vx eC, Wi e I, min fi{yj) < fi{xo) + DcUj{Dc), 

l<j<2" 

where Dc is the diameter of the compact cube C. Using the fact that f{yj) = infjg/ fi{yj) 
is finite, it follows that there exists A G M such that 

Vx e C, Vi G /, fi{x) > A. 

Choose now e > such that B{xq, e) G C. If /j : ^ M is a linear form such that 

e U, f^{y) < f^{xo) + {li,y- xo) + \\y - xo||t^(||y - xoW), 

we obtain that, for every G M" of norm 1, 

A < fi{xo) + {k, ev) + euj{e). 

Since fi„{xo) \ /(xq), we can assume fi„{xo) < M < +oo for all n, that implies 

n, n M-A 

\\kj\ < — \- < +00. 
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Up to extracting a subsequence, we can assume — >• I in M"*, the dual space of M". 
Then, as for every y e [/ we have f{y) < fi^{y), passing to the hmit in n in the inequahty 

f{y) < fini^o) + {ku^y-xo) + \\y - XQ\\uj{\\y - XqW), 

we get 

fiy) < fixo) + {l,y-X(i) + l|y-a;o||a;(|||/-a;o||). 
Since xq & U is arbitrary, this concludes the proof. □ 

Before generalizing the notion of uniformly semi-concave family of functions to mani- 
folds, let us look at the following example. 

Example A. 12. For /c e R, define : R — > M as fk{x) = kx. It is clear that the family 
{fk)keM. is a;-semi-concave for every modulus of continuity cu. In fact 

fkiy) - fkix) = k{y -x) < k{y - x) + \y - x\uj{\y - x\), 

since uj > 0. Consider now the diffcomorphism (/? : M^, ^p{x) = x^. Then there 

does not exist a non-empty open subset U C W^, and a modulus of continuity u, such 
that the family (/^ o (p\u)k£R is (uniformly) a;-semi-concave. Suppose in fact, by absurd, 
that 

fk o <p{y) - fk° < lx{y -x) + \y- x\uj{\y - x\), 

where Ix depends on k but not uj. Since fk o ip is differentiable we must have lx{y — x) — 
{fk o v)'{x){y — x) — 2kx{y — x). Therefore we should have 

ky^ — kx'^ < 2kx{y — x) + \y — x\cu{\y — x\). 

Fix x,y & U, with y ^ x and set h = y — x. Then 

kh^ < \h\uj{\h\) ^k< '^^^ Wk, 

that is obviously absurd. 

Therefore the following is the only reasonable definition for the notion of a uniformly 
locally semi-concave family of functions on a manifold. 

Definition A. 13. We will say that the family of functions /j : M ^ M, i G /, defined 
on the manifold M, is uniformly locally semi-concave (resp. with a linear modulus), if 
we can find a cover {Uj)j^j of M by open subsets, with each Uj domain of a chart 
iPj : Uj Vj C M" (where n is the dimension of M), such that for every j & J the 
family of functions (/j o ipj'^)i^j is a uniformly semi-concave family of functions on the 
open subset Vj of (resp. with a linear modulus). 
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The following corollary is an obvious consequence of Theorem lA.lli 

Corollary A. 14. If the family fi:M^M.,i&Iis uniformly locally semi-concave (resp. 
with a linear modulus) and the function 

f{x) = inf 

is finite everywhere, then / : M — R is locally semi-concave (resp. with a linear modulus). 

Definition A. 15. Suppose c:MxA^^Misa function defined on the product of 
the manifold M by the topological space N. We will say that the family of functions 
{c{-,y))y^iy is locally uniformly locally semi-concave (resp. with a linear modulus), if for 
each yQ E N we can find a neighborhood Vq of yo in N such that the family {c{-,y))y<^Vo 
is uniformly locally semi-concave on M (resp. with a linear modulus). 

Proposition A. 16. Suppose c : M x N —>■ is a function defined on the product of 
the manifold M by the topological space N, such that the family of functions (c(-, y))yeN 
is locally uniformly locally semi-concave (resp. with a linear modulus). If K (Z N is 
compact, and the function 

fxix) = inf c{x,y) 

is finite everywhere on U, then fx'-U ^ M is locally semi-concave on M (resp. with a 
linear modulus). 

Proof. By compactness of K, we can find a finite family V^, i = 1, . . . , £ of open subsets 
of such that K C Uf^j^Vi, and for every i = !,...,£, the family {c{-,y))y^Vi is locally 
uniformly locally semi-concave (resp. with a linear modulus). The function 

fi{x) = M c{x,y) 
yeKnVi 

is finite everywhere on U, because > fx- It follows from Corollary I A . 1 41 that /j is locally 
semi-concave on M (resp. with a linear modulus), for i = 1, . . . , £. Since fx = iaiT^i=i fh 
we can apply part (ii) of Proposition IA.4I to conclude that fx has the same property. □ 

Proposition A. 17. If c : M x N —>■ W is a locally semi-concave function (resp. with a 
linear modulus) on the product of the manifolds M and N, then the family of functions 
on M {c{-,y))y^]y is locally uniformly locally semi-concave (resp. with a linear modulus). 

Proof. We can cover M x by a family (f/j x Wj)i(zij(zj of open sets with f/j open in 
M, Wj open in A^, where Ui is the domain of a chart ipi : f/j — ^ Ui C M" (where n is the 
dimension of M), and Wj is the domain of a chart ipj • C M"^ (where m is the 

dimension of M) , and such that 

{x,y) ^ c{'^i\x),iljj\y)) 
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is j-semi-concave on f/j x Wj, for some modulus ujij. It is then clear that the family 

is uniformly locally tUjj- semi- concave on f/j. □ 

The following corollary is now an obvious consequence of Propositions lA. 17l and IATTgI 

Corollary A. 18. Suppose c : M x — >■ R is a locally semi-concave function (resp. with 
a linear modulus) on the product of the manifolds M and N. Let K be a compact subset 
of N. If the function 

fxix) = ini c{x,y) 

yeK 

is finite everywhere on U , then fx '■ U ^ M is locally uniformly locally semi-concave 
(resp. with a linear modulus). 

We end this section with another useful theorem. The proof we give is an adaptation 
of the proof of [201 Lemma 3.8, page 494]. 

Theorem A. 19. Let (pi,(p2 '■ M —* W be two functions, with (pi locally semi-convex 
(i.e. —ipi locally semi- concave) , and (f2 locally semi-concave. Assume that (fi < <f2- If we 
define £ = {x E M \ ipi{x) = (p2{x)}, then both ipi and ip2 are differentiable at each x & £ 
with dx^pi = at such a point. Moreover, the map x i— >• rf^V^i = dx'{)2 is continuous on 
£. 

If (pi is locally semi-convex and ip2 is locally semi-concave, both with a linear modulus, 
then, in fact, the map x i— > d^^pi = dx^P2 is locally Lipschitz on £. 

o 

Proof. Since the statement is local in nature, we will assume that M =B is the Euclidean 
unit ball of center in M", and that —(fi and (f2 are semi-concave with (common) modulus 
uj. Suppose now that x E £. We can find two linear maps li^x-, h,x : M" — M such that 

^i{y) > + h^cciy -x) - \\y - X\\cuc^^{\\y - X||euc) 

V2iy) < ^2ix) + l2,xiy -X) + \\y - X\\cuci^{\\y - Xlleuc)- 

Using ipi < ip2, and ipi{x) = (p2{x), we obtain 

h,x{y -X) - \\y - X\\enc^^{\\y - x\\cuc) < fl{y) - fl{x) < 

< Lp2{y) - >f2{x) < l2,xiy -x) + \\y - xlleuc^^dlz/ " a;||euc)- (1) 

In particular, we get 

h,xiy -x) - \\y - x||cuct^(||y - a;||euc) < kAy - + \\y - x\\cuc^^i\\y - X||cuc), 
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replacing y hy x + v with ||i'||euc small, we conclude 

h,x{'") - ll^'lleuct^(llt'lleuc) < k,x{'") + \\v\\ euc'^(ll'i^lleuc)- 

Therefore 

cue J ; 

for V small enough. Since l2,x — h,x is linear it must be identically 0. We set Ix = l2,x = h,x- 

o 

For i = 1,2 and y GB, we obtain from (1) 

\Vi{y) - - lx{y - X)\ < \\y - Xlleuc'^dly - a^lleuc)- (2) 

This implies that </?j is differentiable at a; e £, with d^ipi — I- It remains to show the 
continuity of the derivative. Fix r < 1. We now find a modulus of continuity of the 

o o 

derivative on the ball r B. If yi, y2 £ ^ H r B, and \\k\\euc ^ ^ — r, we can apply three 
times (2) to obtain 

^l{y2) - ^l{yi) - dy^(pi{y2 - yi) < \\y2 - |/l||eucCc;(|||/2 - 2/l||euc) 
<Pl{y2 + k)- (pi{y2) - dy^(pi{k) < ||A;||euc'^(||fc||euc) 

-^i{y2 + k) + (pi{yi) + dy^ipi{y2 + k-yi) < \\y2 + k- ?/i||euc^(||y2 + k- yi||euc)- 
If we add the first two inequality to the third one, we obtain 
[dy,(fii - dy^(pi\{k) < II 

y2 l/l II cuc'^' ( II 2/2 2/l||cuc) ~l~ II ^11 cue 

uj{\\k\\ 

euc J 

+ [\\y2-yi\\ euc + k euc]^(||Z/2 yijjeuc + ll^lleuc), 
which implies, exchanging k with —k, and using that the modulus cu is non-decreasing 

\[dy^ipi - dy^ipi\{k)\ < 2[\\y2 - ?/i||euc + ||A)||euc]'^(||y2 " |/l||euc + ||A:||euc)- 

Since \\y2 — ?/i||cuc < 2, we can apply the inequality (3) above with any k such that 
ll^llcuc = (1 —r)\\y2 — 2/i||euc/2. If we divide the inequality (3) by jj/cjjeuc) and take the sup 
over all k such that ||/c||euc = (1 ~ ^)\\y2 — yi||cuc/2, we obtain 

\\dyi^l - dy^(pi\\euc < 2 

It follows that a modulus of continuity of x i— > dxfi on £^ fl r B is given by 

6 — 2r 3 — r 

o 

This implies the continuity of the map x d^^i on £ fl r B- It also shows that it is 

o 

Lipschitz on £ n r B when a; is a linear modulus. □ 



+ 1 



((1 + )l|y2 -yilleuc)- 
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B Tonelli Lagrangians 



B.l Definition and background 

We recall some of the basic definition, and some of the results in Calculus of variations 
(in one variable). There are a lot of references on the subject. In [21], one can find an 
introduction to the subject that is particularly suited for our purpose. Other references 
are [TU] and the first chapters in • A brief and particularly nice description of the main 
results is contained in [Hj. 

Definition B.l (Lagrangian). If M is a manifold, a Lagrangian on M is a function 
L : TM —>■ M. In the following we will assume that L is at least bounded below and 
continuous. 

Definition B.2 (Action). If L is a Lagrangian on M, for an absolutely continuous curve 
7 : [a, b] — > M, a < b, we can define its action A/, (7) by 



Note that the integral is well defined with values in MU {+00}, because L is bounded 
below, and s L{'j{s),'j{s)) is defined a.e. and measurable. To make things simpler to 
write, we set = +00 if 7 is not absolutely continuous. 

Definition B.3 (Minimizer). If L is a Lagrangian on the manifold M, an absolutely 
continuous curve 7 : [a,b] M, with a < 6, is an L-minimizer, if A^l'-f) < Al{6) for 
every absolutely continuous curve 6 : [a,b] ^ M with the same endpoints, i.e. such that 
6{a) = 7(a) and S{b) = 7(&). 

Definition B.4 (Tonelli Lagrangian). We will say that L : TM ^ M is a weak Tonelli 
Lagrangian on M, if it satisfies the following hypotheses: 

(a) L is C^; 

(b) for each x G M, the map L{x, ■) : T^-M ^ M is strictly convex; 

(c) there exist a complete Riemannian metric g on M and a constant C > —00 such 



where ||-||^ is the norm on T^M obtained from the Riemannian metric g; 

(d) for every compact subset K G M the restriction of L to TkM = Uxi^kTxM is 
superlinear in the fibers of TM — >■ M: this means that for every A > 0, there exists 
a constant C{A,K) > —00 such that 




that 



W{x,v)eTM, L{x,v) >\\v\\x + C 



V(x, v) G TkM, L{x, v) > A\\v\\x + C{A, K). 
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We will say that L is a Tonelli Lagrangian, if it is a weak Tonelli Lagrangian, and satisfies 
the following two strengthening of conditions (a) and (b) above: 

(a') L is C^; 

(b') for every (x,v) G TM, the second partial derivative -—-(x,v) is positive definite 
on T^M. 

Since above a compact subset of a manifold all Riemannian metrics are equivalent, if 
condition (d) in the definition is satisfied for one particular Riemannian metric, then it is 
satisfied for any other Riemannian metric. 

Note that when L is a weak Tonelli Lagrangian on M, and t/ : M — > R is a function 
which is bounded below, then L + U, defined by (L + U){x, v) = L{x, v) + U (x) is a weak 
Tonelli Lagrangian. If moreover L is a Tonelli Lagrangian, and U is and bounded 
below, then L + U is a Tonelli Lagrangian. Therefore one can generate a lot of (weak) 
Tonelli Lagrangians from the following simple example. 

Example B.5. Suppose that is a complete smooth Riemannian metric on M, and 
r > 1. We define the Lagrangian L^^g on M by 

Lr,g{x,v) = \\v\\l = g^{v,Vy^^. 

1) L2^g is a Tonelli Lagrangian. 

2) For any r > 1, the Lagrangian is and is a weak Tonelli Lagrangian. 

In both cases, the Riemannian metric mentioned in condition (c) of Definition IB. 41 is the 
same metric g. 

Moreover, the vertical derivative of the Lagrangian Lr^g is given by 

'"iff/ \ II llr-2 / \ 

Proof. Since r > 1 it is not difficult to check that L has (in coordinates) partial derivatives 
everywhere with 

^(x,0)=0 and ^(x,0) = 0, 
ox ov 

and that these partial derivatives are continuous. Therefore L is C^. A simple computation 
gives 

We now prove condition (c) and (d) of Definition IB. 41 at once. In fact, if A is given, we 
have 

Lr,gix, V) = \\v\\l > A\\v\\^ - A^l'"'^, 
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as on can see by considering separately the two cases ||f ||^ ^ > A and ^ < A. The 
rest of the proof is easy. □ 

The completeness of the Riemannian metric in condition (c) of Definition IB.4I above 
is crucial to guarantee that a set of the form 

^ = {7 e C\[a, b],M)\ 7(a) G K, A^i-f) < 

where i^' is a compact subset in M, k is a finite constant, and a < 6, is compact in the C*^ 
topology. In fact, condition (c) implies that the curves in such a set JF have a (yf-length 
which is bounded independently of 7. Since K is compact (assuming M connected to 
simplify things) this implies that there exist xq G M and R < +00 such that all the 
curves in are contained in the closed ball B{xo, R) = {y & M \ d{x, y) < R}, where d is 
the distance associated to the Riemannian metric g. But such a ball B{xq, R) is compact 
since g is complete (Hopf-Rinow Theorem). From there, one obtains that the set JF is 
compact in the C° topology, see [101 Chapters 2 and 3]. 

The direct method in the Calculus of Variations, see [TOl Theorem 3.7, page 114] or 
[2T] for Tonelli Lagrangians, implies: 



Theorem B.6. Suppose L is a weak Tonelli Lagrangian on the connected manifold M. 
Then for every a, 6 G M, a < 6, and every x,y E M, there exists an absolutely continuous 
curve 7 : [a, 6] — > M which is an L-minimizer with 7(a) = x and 7(6) = y. 

In fact in flU[ Theorem 3.7, page 114], the existence of absolutely continuous minimiz- 
ers is valid under very general hypotheses on the Lagrangian L (the C^ hypothesis on L is 
much stronger than necessary). We now come to the problem of regularity of minimizers 
which uses the C^ hypothesis on L: 

Theorem B.7. If L is a weak Tonelli Lagrangian, then every minimizer 7 : [a, 6] — > M 
is C^. Moreover, on every interval [to, ti] contained in a domain of a chart, it satisfies the 
following equality written in the coordinate system 

f)T f)T rh f)T 

— (7(ti),7(ti))-^(7(to),7(to)) = -g^{l{s)Ms))ds, (lEL) 

which is an integrated from of the Euler-lagrange equation. This implies that dL / dv{'y{t) , 'y{t)) 
is a function of t with 



d_ 

dt 



|^(7W,7W)- 



Moreover, if L is a C' Tonelli Lagrangian, with r > 2, then any minimizer is of class 

cr. 
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Proof. We will only sketch the proof. If L is a Tonelli Lagrangian, this theorem would 
be a formulation of what is nowadays called Tonelli's existence and regularity theory. In 
that case its proof can be found in many places, for example [lO], [Ej, or ^2]j|. The fact 
that the regularity of minimizers holds for (or even less smooth) Lagrangians is more 
recent. The fact that a minimizer is Lipschitz has been established by Clarke and Vinter, 
see [ini Corollary 1, page 77, and Corollary 3.1, page 90] (again the hypothesis L is C^ is 
stronger than the one required in this last work). The same fact under weaker regularity 
assumptions on L has been proved in [3]. A short and elegant proof of the fact that a 
minimizer for the class of absolutely continuous curves is necessarily Lipschitz has been 
given by Clarke, see p!5] . The key idea of why 7 should be bounded is that the Energy is 
conserved along absolutely continuous minimizers (this is easy to prove for C^ minimizers, 
see Corollary IB. ISp and the sublevels of the Energy are compact (see Proposition IB. 16]) . 

Once one knows that 7 is Lipschitz, when L is C^ it is possible to differentiate the 
action, see pXlj, ^14j, or [21], and, using an integration by parts, one can show that 7 
satisfies the following integrated form (lEL') of the Euler-Lagrange equation for almost 
every t G [to^^i]) for some fixed linear form c: 

— (7(t),7(t)) = c + —{^{s)Ms))ds. (lEL') 

But the continuity of the right hand side in (lEL') implies that dL / dv{'y{t) , jit)) extends 
continuously everywhere on [to,^i]- Conditions (a) and (b) on L imply that the global 
Legendre transform 

if : TM T*M, 
{x,v) ^ (x, ^{x,v)), 

is continuous and injective, therefore a homeomorphism on its image by, for example, 
Brouwer's Theorem on the Invariance of Domain (see also Proposition IB. 91 below). We 
therefore conclude that 7(t) has a continuous extension to [to, ^i]- Since 7 is Lipschitz this 
implies that 7 is C^. Equation (lEL) follows from (lEL'), which now holds everywhere by 
continuity. □ 

In fact, in this paper, we will only use the cases when L is C^, in which case this 
regularity of minimizers will follow from the "usual" Tonelli regularity theory, or when L 
is of the form L{x,v) = \\v\\^, p > 1, where the norm is obtained from a C^ Riemannian 
metric, in which case the minimizers are necessarily geodesies which are of course as 
smooth as the Riemannian metric, see Proposition IB. 24] below. 

To obtain further properties it is necessary to introduce the global Legendre transform. 
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Definition B.8 (Global Legendre Transform). If L is a Lagrangian on the manifold 

L, its global Legendre transform =Sf : TM —>■ T*M, where T*M is the cotangent bundle 
of M, is defined by 

dL 

^{x,v) = {x, —{x,v)). 

Proposition B.9. If L is a weak Tonelli Lagrangian on the manifold M, then its global 
Legendre transform ^ : TM T*M is a homeomorphism from TM onto T*M. 
Moreover, if L is a C Tonelli Lagrangian with r >2, then ^ is C"^. 

Proof. We first prove the surjectivity of Suppose p G T*M. By condition (d) in 
Definition IB.41 we have 

p{v) - L{x, v) < p{v) - {\\p\l + l)\\v\l-C{\\p\l + 1, {x}) 
<-\\v\l-C{\\p\l + l,{x}). 

But this last quantity tends to — oo, as ||f +00. Therefore the continuous function 
V ^ p{v) — L{x, v) achieves a maximum at some point Vp G T^M . Since this function is C^, 
its derivative at Vp must be 0. This yields p — dL/dv{x,Vp) = 0. Hence {x,p) = =Sf(x, fp). 

To prove injectivity of =Sf , it suffices to show that for v, v' G T^M, with v 7^ v', we have 
dL/dv{x, v) 7^ dL/dv{x, v'). Consider the function if : [0,1] ^ M.,t L{x,tv + {1 — t)v'), 
which by condition (b) of Definition IB .41 is strictly convex. Since it is C^, we must have 
(y9'(0) 7^ V''(l)- In fact, if that was not the case, then the non-decreasing function ^p' would 
be constant on [0,1], and ip would be affine on [0,1]. This contradicts strict convexity. 
By a simple computation, we therefore get 

— {x, v'){v - v') = ^'(0) ^ ^'(1) = —{x, v){v - v'). 

This implies dL/dv{x,v') 7^ dL/dv{x,v). We now show that =Sf is a homeomorphism. 
Since this map is continuous, and bijective, we have to check that it is proper, i.e. inverse 
images under of compact subsets of T*M are (relatively) compact. For this it suffices 
to show that for every compact subset K C M, and every C < +00, the set 

dL 

{{x,v)eTM\xeK, \\ — {x,v)\U<C} 

is compact. By convexity of f (—> L{x,v), we obtain 

dL 

— (x, v){v) > L{x, v) - L{x, 0). 



But \\dL/dv{x,v)\\x > dL/dv{x,v){v/\\v\\x), therefore by condition (d) of Definition IB. 4 
we conclude that 

VA > 0,^{x,v) G TkM, \\-^{x,v)\\x >A- [C{K,A)/\\vU 
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Taking A = C + 1, we get the inclusion 

(9/ 

{{x,v) eTM\xeK, \\ — {x,v)\\^ <C}C {{x,v) eTM\xeK, \\v\\^ < C{K,C + 1)}, 

ov 

and the compactness of the first set follows. 

Suppose now that L is a C" Tonelli Lagrangian with r > 2. Obviously =Sf is C^'^^. 
By the inverse function theorem, to show that it is a C"^ diffeomorphism, it suffices to 
show that the derivative is invertible at each point of TM. But a simple computation in 
coordinates show that the derivative of =Sf at {x, v) is given in matrix form by 

Id 

-—{x,v) 



, dxdv dv"^ 

This is clearly invertible by (b') of Definition IB.4[ □ 

Definition B.IO. If L is a Lagrangian on M, we define its Hamiltonian H : T*M — >■ 
MU {+CX)} by 

H{x,p) = sup p{v) — L{x,v). 

Proposition B.ll. Let L be a weak Tonelli Lagrangian on the manifold M. Its Hamil- 
tonian H is everywhere finite valued and satisfies the following properties: 

(a*) H is C^, and in coordinates 

f) M 

— mx,v)) = v 
^mx,v)) = -^{x,v). 

(b*) for each x G M, the map H{x, ■) : T*M —>■ M is strictly convex; 

(d*) for every compact subset K d M the restriction of H to T^M = Ux£kT*M is 
superlinear in the fibers of T*M — > M: this means that for every A > 0, there 
exists a finite constant C*{A, K) such that 

y{x,p) G T^M, H{x,p) > A\\p\\., + C*{A,K). 

In particular, the function H is a proper map, i.e. inverse images under H of compact 
subsets of M are compact. 

If L is a C Tonelli Lagrangian with r > 2, then 
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(a'*) H is C^■ 

(b'*) for every (x, v) G M, the second partial derivative (x,p) is positive definite on 
T*M. 

Proof. To show differentiabihty, using a chart in M, we can assume that M = f/ is an 
open subset in M™. Moreover, since all Riemannian metrics are equivalent above compact 
subsets, replacing U by an open subset V with compact closure contained in [/, we can 
assume that the norm used in (c) of Definition IB.4I is the constant standard Euclidean 
norm ||-||cuc on the second factor of TV = V ^ M™, that is 

Vx G V, Vt; G M™, L{x,v) > A\\v\\,^, + C{A), 

where C{A) is a finite constant, and sup^g^ L(a;, 0) < C < +00. 

We have T*V = V x M™*, where W^* is the dual space of M*". We will denote by |H|euc 
also the dual norm on M™* obtained from ||-||euc on M™. We now fix i? > 0. If p G M™* 
satisfies ||»|L„f. < R, we have 



p{v) - L{X,V) < Ibllcucll^^llcuc - {R+l)\\v\\euc " C {R + 1) 

< -\\v\Uuc-CiR+l). 
Since L{x, 0) < C for x G V, it follows that, for \\v\\cnc > C — C{R + '^), 

p{v) - L{x, v) <-C < -L{x, 0). 

This implies 

H{x,p) = sup p{v) — L{x,v) = sup p{v) — L{x,v), 

t'GK'" \\v\\cuc<C~C{R+l) 

Therefore the sup in the definition of H{x,p) is attained at a point f(x,p) with ||t'(x,p)||cuc < 
C — C{R+ 1). Note that this point V(x,p) is unique (compare with the argument proving 
that the Legendre transform is surjective). In fact, at its maximum f the function 
V H-> p{v) — L{x,v) must have derivative, and therefore 

dL 

This means {x,p) = ^{x,V(x,p)), but the Legendre transform is injective by Proposition 

MM 

Note, furthermore, that the map 

/ : X {\\p\U < R}) X {||t;||euc <C-C{R+ 1)} ^ R, 
{{x,p),v) i^p{v) - L{x,v), 

is C^. Therefore we obtain that H is from the following classical lemma whose proof 
is left to the reader. 
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Lemma B.12. Let f : N x K —>■ M., {x, k) ^ f{x, k) be a continuous map, where N is 
a manifold, and K is a compact space. Define F : N ^ M by F{x) = sup^g^ /(x, /c). 
Suppose that: 

df 

(i) exists everywhere and is continuous as a function of both variables {x, k); 

(a) for every x ^ N, the set {k E K \ f{x,k) = F{x)} is reduced to a single point, 
which we will denote by k^. 

Then F is C^, and the derivative D^F of F at x is given by 

df 

DxF = kx)- 

Returning to the proof of Proposition IB.IH by the last statement of the above lemma 
we also obtain 

— {x,p) = v^x,p) and —{x,p) = - — {x,v^x,p)) 

Since {x,p) = J!f{x,V(x,p)), this can be rewritten as 

-^—o^(x,v)=v and o J^(x,v) = —-—(x,v), (5) 

op ox ox 

which proves (a*). Note that when L is a C" Tonelli Lagrangian, by Proposition IB.9I the 
Legendre transform ^ is a global diffeomorphism. From the expression of the partial 
derivatives above, we conclude that OH /dp and dH/dx are both C^~^. This proves (a'*). 
We now prove (b'*). Taking the derivative in v of the first equality in ([5]) 



dH 

dp 



dL 



we obtain the matrix equation 



d'^H d'^L 

where the dot ■ represents the usual product of matrices. This means that the matrix 
representative of d"^!! / dp^{x,p) is the inverse of the matrix of a positive definite quadratic 
form, therefore d"^ H / dp^ {x , p) is itself positive definite. 

We prove (b*). Suppose pi ^ p2 are both in T*M. Fix t g]0, 1[, and set ps = 
tpi + (1 — t)p2. The covectors Pi,P2,P3 are all distinct. Call Vi,V2,V3 elements in T^M 
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such that Pi = dL/dv{x, Vi). By injectivity of the Legendre transform, the tangent vectors 
■^1) ^2? "^^3 are also all distinct. Moreover, for i = 1, 2 we have 



H{x,pi) = pi{vi) - L{x,Vij, 

H{x,p3) = ps{vs) - L{x,V3) = t[pi{vs) - L{x,V3)] + (1 -t)[p2{v3) - L{x,Vs)]. 

Since the sup in the definition of H{x,p) is attained at a unique point, and vi,V2,V3 are 
all distinct, for z = 1, 2 we must have 

Pi{v3) - L{x,V3) < Pi{Vi) - L{x,Vi) = H{x,pi). 

It follows that 

H{x,tp, + (1 -t)p2) < tH{x,pi) + (1 - t)H{x,p2). 
It remains to prove (d*). Fix a compact set K in M. Since 

H{x,p) > p{v) — L{x,v), 

we obtain 

H{x,p)> sup p{v) + inf — 

But C*{A,K) = mix^K,\\v\\:,<A —L{x,v) is finite by compactness, and sup|j^|| <aP(^) = 
A\\p\U. ' " □ 

Since for a weak Tonelli Lagrangian L, the Hamiltonian H : T*M — ^ M is C^, we can 
define the Hamiltonian vector field Xh on T*M. This is rather standard and uses the fact 
that the exterior derivative of the Liouville form on M defines a symplectic form on M, 
see [T] or [2H]- The vector field is entirely characterized by the fact that in coordinates 
obtained from a chart in M, it is given by 

Xh{x,p) = 

So the associated ODE is given by 

P=~^{x,p). 

In this form, it is an easy exercise to check that H is constant on any solution of Xh- 

We know come to the simple and important connection between minimizers and solu- 
tions of Xff. 
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Theorem B.13. Suppose L is a weak Tonelli Lagrangian on M. If '-f : [a,b] ^ M is a 
minimizer for L, then the Legendre transform of its speed curve t ^ =Sf (7(t), 7(t)) is a 
solution of the Haniiltonian vector field Xh obtained from the Hamiltonian H associated 
to L. 

Moreover, if L is a Tonelli Lagrangian, there exists a (partial) flow 0^ on TM such 
that every speed curve of an L-minimizer is a part of an orbit of (p^. This flow is called 
the Euler- Lagrange flow, is defined by 

where 0^ is the partial flow of the vector filed Xh- 
Proof. If we write {x{t),p{t)) = ^(7(t), 7(t)) then 

dL 

x{t)=^{t) and p(t) = — (7(t),7(t)). 

By Theorem IB. 71 x{t) = 7(t) is with x{t) = 7(t). The fact that p{t) is follows again 
from Theorem IB. 71 which also yields in local coordinates 

dL 

PW = ^(7(t),7W)- 

Since {x(t),p(t)) = ^{'y(t),j(t)), we conclude from Proposition IB . 1 1 1 that t {x(t),p(t)) 
satisfies the ODE 

P=-^{x,p). 

Therefore the Legendre transform of the speed curve of a minimizer is a solution of the 
Hamiltonian vector field Xh- 

If L is a Tonelli Lagrangian, by Proposition IB. 1 11 the Hamiltonian if is C^. Therefore 
the vector field Xh is C^, and it defines a (partial) fiow (p^ . The rest follows from 
what was obtained above and the fact that the Legendre transform is C^. □ 

We recall the following definition 

Definition B.14 (Energy). If L is a Lagrangian on the manifold M, its energy E : 
TM ^ R is defined by 

dL 

E(x, v) = H o ^(x, v) = T^(x, v)(v) — L(x, v)- 

ov 
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Corollary B.15. [Conservation of Energyjif L is a Lagrangian on the manifold M, 
and 7 : [a, 6] ^ M is a minimizer for L, then the energy E is constant on the speed 
curve 

s ^ (7(s),7(s)). 

Proof. In fact -E(7(s), 7(5)) = if o ^(7(5), 7(5)). But s 1-^ ^(7(5), 7(5)) is a solution of 
the vector field H, and the Hamiltonian H is constant on orbits of Xh- □ 

Proposition B.16. If L is a weak ToneUi Lagrangian on the manifold M, then for every 
compact subset K C M, and every C < +00, the set 

{{x, v) e TM \xeK, E{x, v) < C} 

is compact, i.e. the map E : TM M is proper on every subset of the form 7r~^{K), 
where K is a compact subset of M. 

Proof. Since E — H o this follows from the fact that H is proper and =Sf is a homeo- 
morphism. □ 

Proposition B.17. Let L be a weak ToneUi Lagrangian on M. Suppose K is a compact 
subset of M, and t > 0. Then we can find a compact subset K G M and a finite constant 
A, such that every minimizer 7 : [0,t] — >• M with 7(0), 7(t) G K satisfies 7([0,t]) C K 
and ||7(s)||7(s) < A for every s G [0, t]. 

Proof. We will use as a distance d, the one coming from the complete Ricmannian metric. 
All finite closed balls in this distance are compact (Hopf-Rinow theorem). We choose 
Xq e K, and it! such that K C B{xo,R) (we could take R — diam(ii'), the diameter of 
K). We now pick x,y e K. li a : [0, t] — > M is a geodesic with a{0) — x, a{t) — y and 
whose length is d{x,y) (such a geodesic exists by completeness), the inequality 

d{x, y) < d{x, Xo) + d{xo, y) < 2R 

implies that Q;([0,t]) C B{xo,3R). Moreover ||a(s)||„(s) = d{x,y)/t < 2R/t for every 
s e [0, t] . By compactness, the Lagrangian L is bounded on the set 

^ = {{z, v) e TM I z e B{xo, 3R), \\v\\^ < 2R/t}. 

We call 6 an upper bound of L on J^. Obviously the action of a on [0, t] is less than t9, and 
therefore if 7 : [0, t] — >• M is a minimizer with 7(0), 7(t) G K, we get L(7(s), 7(5)) ds < 
to. Using condition (c) on the Lagrangian L and what we obtained above, we see that 

Ct+ I \\i{s)\\^^s)ds<te. 
Jo 
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It follows that we can find Sq £ [0, t] such that 

||7(^o)IU.o)<^-^- 

Moreover 

^{[0,t]) C B{^{0),t{e - C)) C B{xo,R + t{e - C)). 
We set K = B(xo, R + t(e- C)). If we define 

9i = sup{E{z,v) I {z,v) e TM, zeK, \\v\\^ <0-C}, 

we see that 9i is finite by compactness. Moreover £'(7(so), 7(so)) ^ ^i- But, as mentioned 
earlier, the energy £^(7(s), 7(s)) is constant on the curve. This implies that the speed curve 

s ^ (7(5), 7(s)) 

is contained in the compact set 

^ = {{z,v) eTM\zeK, E(z,v) < 61}. 
Observing that the set ^ does not depend on 7, this finishes the proof. □ 

B.2 Lagrangian costs and semi-concavity 

Definition B.18 (Costs for a Lagrangian). Suppose L : TM — M is a Lagrangian on 
the connected manifold M, which is bounded from below. For t > 0, we define the cost 
Ct,L : M X M ^ R by 

CtA^^y) = inf Ai(7) 



where the infimum is taken over all the absolutely continuous curves 7 : [0, t] — >■ M, with 

It 



7(0) = X, and 7(t) = y, and Al^-j) is the action ^(7(5), 7(5)) ds of 7. 



Using a change of variable in the integral defining the action, it is not difficult to 
see that Ct,L = Ci,l* where the Lagrangian L* on M is defined by L^{x,v) = tL{x,t~^v). 
Observe that L* is a (weak) Tonelli Lagrangian if L is. 

Theorem B.19. Suppose that L : TM — R is a weak Tonelli Lagrangian. Then, for 
every t > 0, the cost q^l is locally semi-concave on M x M. Moreover, if the derivative 
of L is locally Lipschitz, then ct,L is locally semi-concave with a linear modulus. 

In particular, if L is a Tonelli Lagrangian for every t > 0, the cost Ct,L is locally 
semi-concave on M x M with a linear modulus. 
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Proof. By the remark preceding the statement of the theorem, it suffices to prove this for 
c = Ci^L- Let n be the dimension of M. Choose two charts ipi : f/j M", i = 0, 1, on M. 
We will show that 

o o 

is semi-concave on B x B, where B is the closed Euclidean unit ball of center in R". By 
Proposition IB. 17l we can find a constant A such that for every minimizer 7 : [0, 1] — > M, 
with 7(i) G Lp~'^(M), we have 

VsG [0,1], l|7(s)l|^(.) < A 
We now pick 6 > such that for all zi, Z2 G M", with ||2;i||cuc < 1, ||-22||cuc = 2, 

rf(</.-i(2;i),^-i(^2))>5, « = 0,1, 

where |H|euc denote the Euclidean norm. Then we choose £ > such that Ae < 5. It 
follows that 

7([0,£]) C ^0 '(2 B) and 7([1 - e, 1]) C ip-,\2 b). 

We set Xi = (pi{^{i)), 2 = 0,1. For ho, hi G M" we can define 7^0 : [0,£:] ^ M" and 
7^, : [1 - £, 1] ^ as 

6 — S 

IhM = ^~ ^^~^^ /ii + y^i(7(g)), l-£<s<l. 

We observe that when Hq = (or hi = 0) the curve coincide with 7. Moreover 7/io(0) = 
5^0 + ^0, 7/ii(l) = 5;i + /ii- We suppose that ||/ii||cuc < 2. In that case the images of both 

o 

7/tQ and 'jho are contained in 4 B and 

||7/i.(-5)||euc < \\hi\\cuc + WiVi ° 7)'('5)||euc < 2 + \\{Lpi O 7)'(s)||cuc- 

Since we know that the speed of 7 is bounded in M, we can find a constant Ai such that 

VsG[0,e], \\%Xs)\\cuc < Ai, 

VSG [1-6,1], \\^hAs)\\enc < Ai. 

To simplify a little bit the notation, we define the Lagrangian Lj : R" x M" — > R by 

L,iz,v) = Li^7\z),D[cp7']iv)). 
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If we concatenate the three curves <^o ^ ° 7/io' 7l[£,i-e] '^^'^ fi^° Ihi, we obtain a curve in 
M between (Pq^{xq + hg) and ip]^^{xi + hi), and therefore 



Jo 



Hence 

c {(fio'^ixo + ho), ipi^{xi + hi)) - c ((^0 ^(xo), <^r^(^i)) 

< / [Lo(%,(t),%,(t))-Lo('Poo^{t),{ipoojy(t))]dt 
Jo 

+ I [Li{%,{t),%M - Li{vi°l{t).{Vi°l)'{t))\ dt. 

Jl-e 

We now call u a common modulus of continuity for the derivative DLq and DLi on the 
compact set -6(0, 4) x 5(0, Ai). Here DLq and DLi denote the total derivatives of Lq and 
Li, i.e. with respect to all variables. When L has a derivative which is locally Lipschitz, 
then DLq and DLi are also locally Lipschitz on x R", and the modulus u; can be taken 

o 

linear. Since %i(s) EB (0,4) and ||7?ij(s)|| < Ai, we get the estimate 



c{ipo ^{xo + ho),iPi ^{xi + hi)) - c(<^o ^(^o), ^i ^(xi)) 



ho, — ho I dt 

e e 



< f DLo(^oo^(t),(^oo^y(t)) 
Jo 

+ ^ DLi {ipi o 7(i), {^i o ^)'{t)) (^-^li^-^hi, hi^ dt 

+ LlI ( -||/io||euc J -ll^olleuc + 1^ ( " ||euc J -||/il||euc- 



We observe that the sum of the first two terms in the right hand side is hnear, while the 
sum of the last two is bounded by 

^UJ Q||(/io,^l)||euc^ 11(^0, ^l)||euc- 

Therefore we obtain that 

{xo, xi) ^ c {(Pq^{xo), <pT^{xi)) 
is semi-concave for the modulus u;{r) = ^uj (^r) on B x B, as wanted. □ 
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Corollary B.20. If L is a weak Tonelli Lagrangian on the connected manifold M, then, 
for every t > 0, a superdifferential of Ct^L^x, y) at {xq, i/q) is given by 

dL dL 

{wo,w,) H-> —{^{t),i{t)){w,) - — (7(0),7(0))K), 

where : [0,t] M is a minimizer for L with 7(0) = Xq, 7(t) = Uq, and [wq^Wi) G 
T^M X TyM = T(^,y)(M X M). 

Proof. Again we will do it only for t = 1. If we use the notation introduced in the previous 
proof, we see that a superdifferential of 

(Xo, Xi) C ((^0 ^(xo), ^i\xi)) 

is given by 

{ho, hi) t-> Zo(/io) + k{hi), 

where 



k{ho 



'-t-edLo 







e dx 



+ 



I dip 
e dv 



(v9o0 7(t),((^o0 7)'W)(M dt, (6) 



h{hi) 



l-e 



t-{l-e) dLi 
e dx 



{^io^{t),{ipio^y{t)){hi) 



1 dL 

+ -^{Vi°lit).{Vi^l)'it)){h{) 



dt. 



By Theorem IB.71 the curve t ^ ipo o '^(t) is a extremal of Lq and it satisfies the 
Euler-Lagrange equation 

(V'o o lit), i^o o l)'{t)) ^ (ifo o 7(t), (y?o o l)'{t)) . 
Using this identity in ([6]) an integrating by parts, we get 



lo{ho 



dv 



(V?o0 7(0),(v^o0 7)'(0))(/io)- 



This means that Iq, reinterpreted on T^^^M rather than on M", is given by — ^ (7(0); 7(0))- 
The treatment for li is the same. □ 

We have avoided the first variation formula in the proof of Corollary IB.20| because 
this is usually proven for variation of curves and Lagrangians. Of course, our 
argument to prove this Corollary is basically a proof for the first variation formula for 
Lagrangians. This is of course already known and the proof is the standard one. 
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B.3 The twist condition for costs obtained from Lagrangians 

Lemma B.21. Let L be a weak Tonelli Lagrangian on the connected manifold M. Sup- 
pose that L satisfies the following condition: 

(UC) If 7j : [aj,6j] —>■ M,i = 1,2 are two L-minimizers such that 7i(to) = 72(^0) aiid 
7i(^o) = 72(^0); for some to G [ai,6i] fl [02, ^2], then 71 = 72 on the whole interval 
[ai, 61] n [02, 62]- 

Then, for every t > 0, the cost ct^i : M x M — ^ M satisfies the left (and the right) twist 
condition of Definition \2.4[ 

Moreover, if {x,y) G V{A[^^), then we have: 

(i) there is a unique L-minimizer 7 : [0, t] — ^ M such that x = 7(0), and y = 7(t); 

(ii) the speed 7(0) is uniquely determined by the equality 

Proof. We first prove part (ii). Pick 7 : [0, t] — M an L-minimizer with x = 7(0) and 
y = 7(t). From Corollary IB. 201 we obtain the equality 

^(^'?/) = -|^(^'7(0)). (*) 

Since the map v 1-^ L{x,v) is strictly convex, the Legendre transform v G T^M H-i> 
dL/dv{x,v) is injective, and therefore 7(0) G T^M is indeed uniquely determined by 
Equation (*) above. This proves (ii). 

To prove statement (i), consider another L-minimizer 71 : [0, t] — > M is x = 71 (0). By 
what we just said, we also have 

%^(x,y) = -§(x,7i(0)). 

By the uniqueness already proved in statement (ii), we get 71 (0) = 7(0)- It now follows 
from condition (UC) that 7 = 71 on the whole interval [0,t]. 

The twist condition follows easily. Consider {x,y), {x,yi) G V{A[^^) such that 

-^{x,y) = ^ix,y,),. (**) 

By (i) there is a unique L-minimizer 7 : [0,t] — > M (resp. 71 : [0,t] — * M) such that 
X = 7(0), y = 7(1) (resp. X = 7i(0), ?/i = 7i(l)), and 

%^(a;,y) = -|;^(a;,7(0)) and ^^(x, yi) = -^(x, 7i(0)). 
ox ov ox ov 
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^From equation (**), and the injectivity of the Legendre transform of L, it follows that 
7i(0) = 7(0). From condition (UC) we get 7 = 71 on the whole interval [0, t]. In particular, 
we obtain y = 'y(t) = 71 (t) = yi- □ 

The next lemma is an easy consequence of Lemma [B . 2 1 1 above . 

Lemma B.22. Let L be a weak Tonelli Lagrangian on M. If we can find a continuous 
local flow (pt defined on TM such tliat: 

(UC) for every L-niininiizer 'J : [a,b] M, and every ti,t2 G [a,b], the point (f)t2-ti{liti),j{tij) 
is defined and (7(^2), 7(^2)) = (pt2^ti{l{ti),iih)), 

then L satisfies (UC). Therefore, for every t > 0, the cost ct^L : M x M —>-M. satisfies the 
left twist (and the right ) condition of Definition \2.4[ 

Moreover, if {x, y) G ^^(A[.^ ^), then y = 7r0j(x, v), where n : TM — ^ M is the canonical 
projection, and v G T^M is uniquely determined by the equation 

ox ov 

The curve s G [0,t] t-^ 7r0s(x,f) is the unique L-niinimizer 7 : [0,t] ^ M with 7(0) = 
a;,7(l) = y. 

Note that the following proposition is contained in Theorem IB.13I 

Proposition B.23. If L is a Tonelli Lagrangian, then it satisfies condition (UC) for the 
Euler Lagrange Row 0f . 

Proposition B.24. Suppose g is a complete Riemannian metric on the connected man- 
ifold M, and r > 1. For a given t > 0, the cost ct^Lr gOf the weak Tonelli Lagrangian L^^g, 
defined by 

Lr,gix,v) = \\v\\l= g^iv^vY^"^, 



is given by 



9^ 

where dg is the distance defined by the Riemannian metric. The Lagrangian Lr^g satisfies 
condition (UC) of Lemma [B.22\ for the geodesic flow 0f of g. Therefore its cost Ct,Lr,g 
satisfies the left (and the right) twist condition. Moreover, if {x,y) G P(A|,^^ ), then 
y = 7T(f)f{x, v), where tt : TM — * M is the canonical projection, and v G T^M is uniquely 
determined by the equation 
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Proof. Define sbyl/s + l/r = l. Let 7 : [a, 6] ^ M be a piecewise curve. Denoting 
by igi'j) the Riemannian length of 7, we can apply Holder inequality to obtain 

''h{s)\\.ds<{b-ay/^(^j'Ms)\\:ds^ ' 

with equality if and only if 7 is parameterized with ||7(s)||^ constant, i.e. proportionally 
to arc-length. This of course implies 



with equality if and only if 7 is parameterized proportionally to arc-length. Since any 
curve can be reparametrized proportionally to arc- length and r/s = r — 1, we conclude 
that 

and that an Lr^^-minimizing curve has to minimize the length between its end-points. 
Therefore any L^^g-minimizing curve is a geodesic and its speed curve is an orbit of the 
geodesic fiow Therefore Lr^g satisfies condition (UC) of Lemma iB. 221 for the geodesic 
fiow (f)f of g. The rest of the proposition follows from Lemma [B. 221 □ 



References 

[1] R. Abraham & J.E. Marsden: Foundations of mechanics. Second edition, re- 
vised and enlarged. (1978) Benjamin/ Cummings Publishing Co. Inc. Advanced Book 
Program, Reading, Mass. 

[2] L. AmbrosiO: Lecture notes on optimal transport problems, in Mathematical As- 
pects of Evolving Interfaces, Lecture Notes in Math., 1812, (2003) Springer- Verlag, 
Berlin/New York, 1-52. 

[3] L. Ambrosio, O. Ascenzi & G. Buttazzo: Lipschitz regularity for minimizers 
of integral functional with highly discontinuous integrands. J. Math. Anal. AppL, 
142 (1989), no. 2, 301-316. 

[4] L. Ambrosio, N. Gigli & G. Savare: Gradient flows in metric spaces and in the 
Wasserstein space of probability measures. Lectures in Mathematics, ETH Zurich, 
(2005) Birkhauser. 

[5] L. Ambrosio & A. Pratelli: Existence and stability results in the theory of 
optimal transportation, in Optimal Transportation and Applications. Lecture Notes 
in Math., 1813, (2003) Springer- Verlag, Berlin/New York, 123-160. 



51 



[6] V. Bangert: Analytische Eigenschaften konvexer Funktionen auf Riemannschen 
Manigfaltigkeiten. J. Reine Angew. Math., 307 (1979), 309-324. 

[7] P. Bernard & B. Buffoni: Optimal mass transportation and Mather theory. J. 
Eur. Math. Soc, 9 (2007), no. 1, 85-121. 

[8] P. Bernard & B. Buffoni: The Monge problem for supercritical Mane potential 
on compact manifolds. Adv. Math., 207 (2006), no. 2, 691-706. 

[9] Y. Brenier: Polar factorization and monotone rearrangement of vector-valued func- 
tions. Comm. Pure Appl. Math., 44 (1991), 375-417. 

[10] G. BUTTAZZO, M. GlAQUINTA & S. HiLDEBRANDT: One- dimensional variational 
problems. Oxford Lecture Series in Mathematics and its Apphcations, 15 (1998), 
Oxford Univ. Press, New York. 

[11] L. Caffarelli, M. Feldman & R.J. McCann: Constructing optim,al maps for 
Mongers transport problem as a limit of strictly convex costs. J. Anier. Math. Soc, 
15 (2002), 1-26. 

[12] L. A. Caffarelli & S. Salsa, Editors: Optimal transportation and applications. 
Lecture Notes in Mathematics, 1813. Lectures from the C.I.M.E. Summer School 
held in Martina Franca, September 2-8, 2001, (2003) Springer- Verlag, Berhn. 

[13] P.Cannarsa & C.SlNESTRARl: Scmiconcavc Functions, Hamilton- Jacobi Equa- 
tions, and Optimal Control. Progress in Nonlinear Differential Equations and Their 
Apphcations, 58 (2004), Birkhauser, Boston. 

[14] F.H. Clarke: Methods of dynamic and nonsmooth optimization. CBMS-NSF Re- 
gional Conference Series in Apphed Mathematics, 57 (1989), Society for Industrial 
and Applied Mathematics (SIAM), Philadelphia, PA. 

[15] F.H. Clarke: A Lipschitz regularity theorem. Ergodic Theory Dynam. Systems, to 
appear (2007). 

[16] F.H. Clarke & R.B. Vinter: Regularity properties of solutions to the basic prob- 
lem in the calculus of variations. Trans. Amcr. Math. Soc, 289 (1985), 73-98. 

[17] D.L. COHN: Measure theory. Birkhauser Boston, (1980), Mass. 

[18] L.C. Evans : Partial differential equations and Monge-Kantorovich mass transfer, in 
R. Bott ct al., editors, Current Developments in Mathematics. (1997), International 
Press, Cambridge, 26-78. 



52 



[19] L.C. Evans & W. Gangbo: Differential equations methods for the Mongean- 
torovich mass transfer problem. Mem. Amer. Math. Soc. 137 (1999). 

[20] A. Fathi: Regularity ofC^ solutions of the Hamilton- Jacobi equation. Ann. Fac. Sci. 
Toulouse Math. (6), 12 (2003), 479-516. 

[21] A. Fathi : Weak KAM theorems in Lagrangian Dynamics. Book to appear. 

[22] H. Federer: Geometric measure theory. Die Grundlehren der mathematischen Wis- 
senschaften, 153 (1969) Springer- Verlag New York Inc., New York. 

[23] M. Feldman & R.J. McCann: Monge's transport problem on a Riemannian 
manifold. Trans. Amer. Math. Soc., 354 (2002), 1667-1697. 

[24] A. FiGALLi: The Monge problem on non-compact manifolds. Rend. Sem. Mat. Univ. 
Padova, 117 (2007), 147-166. 

[25] J.N. Mather & G. Forni: Action minimizing orbits in Hamiltonian systems, 
Transition to chaos in classical and quantum mechanics, Montecatini Terme, 1991. 
Lecture Notes in Math., 1589 (1994), Springer, Berhn, 92-186. 

[26] W. Gangbo : The Monge mass transfer problem and its applications, in Monge 
Ampere equation: applications to geometry and optimization. Contemp. Math., 226 
(1999), Amer. Math. Soc, Providence, RI, 79-104. 

[27] W. Gangbo & R.J. McCann: The geometry of optimal transportation. Acta 
Math., 177 (1996), 113-161. 

[28] H. Hofer & E. Zehnder: Symplectic invariants and Hamiltonian dynamics. 
Birkhuser Advanced Texts: Basler Lehrbiicher. (1994) Birkhuser Verlag, Basel. 

[29] L.V. Kantorovich: On the transfer of masses. Dokl. Akad. Nauk. SSSR, 37 
(1942), 227-229. 

[30] L.V. Kantorovich: On a problem of Monge. Uspekhi Mat. Nauk., 3 (1948), 225- 
226. 

[31] M. Knott & C.S. Smith: On the optimal mapping of distributions. J. Optim. 
Theory Appl., 43 (1984), 399. 

[32] J.N. Mather: Existence of quasiperiodic orbits for twist homeomorphisms of the 
annulus. Topology, 21 (1982), 457-467. 

[33] R. McCann: Polar factorization of maps on Riemannian manifolds. Geom. Funct. 
Anal., 11 (2001), no. 3, 589-608. 



53 



[34] G. MoNGE: Memoire sur la Theorie des Deblais et des Remblais. Hist, de I'Acad. 
des Sciences de Paris (1781), 666-704. 

[35] J. MOSER: Selected chapters in the calculus of variations, Lectures in Mathematics 
ETH Ziirich, Lecture notes by Oliver Knill, (2003), Birkhauser Verlag, Basel. 

[36] S.T. Rachev & L. Ruschendorf: A characterization of random variables with 
minimum LP'-distance. J. Multivariate Anal., 32 (1990), 48-54. Corrigendum in J. 
Multivariate Anal., 34 (1990), 156. 

[37] S.T. Rachev & L. Ruschendorf: Mass transportation problems. Vol. I: Theory, 
Vol. II: Applications. Probability and its applications. Springer, (1998). 

[38] W. SCHACHERMAYER & J. Teichmann: Characterization of optimal Transport 
Plans for the Monge-Kantorovich-Problem. Proceedings of the A. M.S., to appear. 

[39] V.N. Sudakov: Geometric problems in the theory of infinite- dimensional probability 
distributions. Proc. Steklov Inst. Math. 141 (1979), 1-178. 

[40] N.S. Trudinger & X.J. Wang: On the Monge mass transfer problem. Calc. Var. 
Partial Differential Equations, 13 (2001), 19-31. 

[41] C. ViLLANi: Topics in mass transportation. Graduate Studies in Mathematics, 58 
(2004), American Mathematical Society, Providence, RI. 

[42] C. ViLLANi: Optimal transport, old and new. Lecture notes, 2005 Saint-Flour sum- 
mer school. 

[43] R. ViNTER: Optimal control, Systems & Control: Foundations & Applications, 
(2000), Birkhauser Boston Inc., Boston, MA. 



54 



